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Abstract 

Many classical problems of computer science — such as pagingFschedulingFand maintaining 
dynamic data structures — are naturally on-line; an algorithm for such a problem is constantly 
making irrevocable decisions without knowing what its future input will be. The competitive 
analysis of on-line algorithms was brought into prominence by the work of Sleator and Tarjan 
in 1985 as a theoretical framework in which to measure the performance of such algorithms. 
Since thenFa variety of on-line problems have been studied from this perspective. 

We consider two such problems in this setting: robot navigationr and computationally 
efficient algorithms for the A;-server problem of ManasserMcGeochrand Sleator. For the first 
of theserwe give algorithms for a robot searching for a goal in an unknown simple polygon; our 
approach can be viewed as an extension of some of the strategies of Baeza-YatesrCulbersonr 
and Rawlins to a more general class of geometric search spaces. We also provide the first 
competitive analysis of the robot localization problem — a fundamental task for an autonomous 
mobile robotTin which it must determine its location in a known environment. 

Finallyrwe address the general question of how the performance of an on-line algorithm 
varies with the computational resources it has at its disposal. Within the context of the A;-server 
problemra natural specialization of this question is the following: can an algorithm which uses 
only constant spaceFand constant time in processing each requestTmatch the performance of 
algorithms which are allowed to perform an arbitrary amount of computation at each step? We 
show a non-trivial lower bound for the class of 2-server balancing algorithmsFa subclass of the 
constant-time algorithms; this represents one of the first lower bounds for an on-line algorithm 
based solely on its computational resources. 

Thesis Supervisor: Michel X. Goemans 

Title: Assistant Professor of Applied Mathematics 
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Chapter 1 



Introduction 



The distinction between on-line and off-line algorithms is an old one in computer scienceFyet 
the current surge of on-line algorithms research in the theory community is relatively new. In 
large part this is due to the surprisingly rich structure of the competitive ratioT brought into 
focus by Sleator and Tarjan [ST] in 1985 as a means of analyzing on-line algorithms. 

At a general leveiran on-line algorithm is one which receives its input in a piecemeal fashion; 
at each stepUt must make irrevocable decisions before seeing the remainder of the input. An off- 
line algorithmron the other handrhas the luxury of reading the entire input before performing 
any computation. The competitive ratio of an on-line algorithm is then defined to be the worst- 
case ratio of the cost it incurs on a given problem instance to the cost of the optimum (off-line) 
solution. (Readers not familiar with the definitions in this area should read Chapter 2 before 
this Introduction.) 

In this thesisrwe consider two naturally on-line problems within the framework of compet- 
itive analysis: robot navigationFand the A;-server problem. 

In a typical on-line navigation problemFa robot with vision is placed in a geometric envi- 
ronment for which it does not have a map; it must perform some task in this environment Tsuch 
as constructing a mapFor finding a recognizable goal hidden in the environment. Baeza-Yatesr 
CulbersonFand Rawlins [BCR]Tin one of the first papers in this areaFconsidered the latter type 
of search problem. They gave algorithms for searching in a number of structured situationsr 
including the problem of searching for a point on a liner a line in the planer and a point in a 
grid graph. The basis for these algorithms is a collection of techniques that have found wide 
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application in subsequent papers. 

In Chapter 3rwe consider more geometric versions of this search problem — specifically the 
problem of a robot searching for a goal in an unknown rectilinear polygon. This differs from 
the model of [BCR] in that the structure of the space to be searched is not known to the robot; 
the generality of rectilinear polygons also makes possible a much wider class of search spaces. 
The search algorithm we give is fairly naturair performing exploration on larger and larger 
regions of the polygon. This problem of exploration in a simple rectilinear polygon has been 
considered separately by DengFKamedaFand Papadimitriou [DKP]Twho gave an algorithm 
with a competitive ratio of 2; we offer a simple randomized variant with competitive ratio 5/4. 

It is also interesting to consider special cases of the search problem in a polygon — re- 
stricting the type of polygon so that better competitive ratios can be obtained. This was the 
approach taken by Klein [K]Twho considered the class of streets and gave a search algorithm 
with competitive ratio at most 1 + |7r (~ 5.71). We give an algorithm for streets with compet- 



itive ratio at most y 4 + V8 (~ 2.61)rimproving on this bound by more than a factor of two. 
A number of other types of polygons may well be amenable to the same type of approachrand 
this appears to be an interesting direction for further work. 

Not all on-line navigation problems assume a complete lack of information about the envi- 
ronment r however. For exampleFa fundamental task for an autonomous mobile robot is that 
of localization — determining its location in a known environment. This problem arises in set- 
tings that range from the computer analysis of aerial photographs to the design of autonomous 
Mars rovers. Typicallyriocalization occurs in two phases: first the robot determines all possible 
locations consistent with its current local view of the environment; then it moves aroundrper- 
forming enough "reconnaissance" to determine uniquely where it is. Despite the attention that 
localization has received in the robotics literaturerthe first theoretical work on it appeared only 
recently in a paper of GuibasFMotwaniFand Raghavan [GMR]; they gave geometric algorithms 
for the first part of the problem — enumerating locations consistent with a view — but did not 
address the issue of strategies for the second part. 

In Chapter 4Fwe consider this aspect of localizationFproviding algorithms for a mobile robot 
to move around in its environment so as to determine efficiently where it is. We argue that 
localization is a natural setting in which to apply the competitive analysis of on-line algorithms; 
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the distance traveled by the robot is compared to the length of the shortest possible localizing 
tour. The search strategies of [BCR] allow one to obtain a fairly straightforwardrand non- 
optimairiocalization algorithm; our main result here is an algorithm which makes stronger use 
of the robot's knowledge of the map in order to improve asymptotically on this approach. An 
interesting feature of our technique is the way in which the robot is able to identify "critical 
directions" in the environment which allow it to perform late stages of the search more efficiently. 

Finallyrin Chapter 5rwe consider some problems related to the computational resources 
required by an on-line algorithm. The overwhelming majority of research in competitive analysis 
has proceeded along information-theoretic lines; that isrthe only factor limiting the performance 
of an on-line algorithm is its lack of information about the nature of future input. But for any 
given problemra natural question to ask is whether the best competitive ratio attainable by an 
on-line algorithm depends on the computational resources it has at its disposal. We consider 
such questions within the context of the A;-server problem — an elegant generalization of paging 
and certain types of scheduling problemsFand one of the most well-studied problems in the area 
of on-line algorithms. HereFan algorithm is presented with a metric space M and k mobile 
servers that can move within M. Points of M request servicer one after anotherFand the 
algorithm must move a server to the location of each request before seeing the next one. The 
goal is to minimize the total distance traveled by the servers. (See Chapter 2 for background 
on the A;-server problem.) 

We say that a real-time server algorithm is one which uses only a constant amount of spacer 
and constant time per request. This notion has been studied especially with respect to the 2- 
server problemrby Irani and Rubinfeld [IR] and Chrobak and Farmore [CF2]. In this setting it 
has been observed thatTwhile there are computationally expensive 2-server algorithms achieving 
the optimal competitive ratio of 2 [MMSrCFirCF4]rthe best known competitive ratio for a 
real-time algorithm is 4 [CF2]. 

In Chapter 5Fwe give a real-time 2-server algorithm which is 2-competitive in ra-dimensional 
space under the L\ ("Manhattan") metric. Fhis considerably extends the class of metric spaces 
known to have optimal real-time algorithms; the technique we use can be seen as a higher- 
dimensional generalization of the elegant "Double- Coverage" algorithm discovered for k servers 
on a line by ChrobakrKarloffFPaynerand Vishwanathan [CKPV]. We also prove a lower bound 
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for 2-server balancing algorithmsra sub-class of the real-time algorithms. Balancing algorithms 
have been proposed for a number of special cases of the A;-server problem; they are so named 
because they seek to "balance" the distance traveled evenly among the servers. We show that 
no such algorithm can achieve a competitive ratio better than (5 + v7)/2 (~ 3.82) for the 
2-server problem; this shows that no optimal on-line 2-server algorithm can be expressed as a 
balancing algorithm. This also representsrto our knowledgeFone of the first lower bounds for 
an on-line algorithm based solely on its computational resources. 

Some Notation 

We use standard order-of-growth notation (OTfir©) throughout; additionally we say that f(n) 

is o(g(nj) if 

lim ^H = 
n^oo g(n) 

and f(n) is u(g(n)) if g is o(f). 

Two different metrics in R d will be used at various places: L\ and L 2 . In the L\ ("Man- 
hattan") metricrthe distance between points x = (x 1 ,...,x d ) and y = (y 1 ,...,y d ) is given 

by 

d 

Li(x,y) = J2\ xi -v'l- 

8 = 1 

The L-) metric is the standard Euclidean distance; that \sT 



d 



\ 1/2 

L 2 (x,y)= [J2(x l -y l f) 



=i 
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Chapter 2 



On-Line Algorithms: An Overview 



The study of on-line algorithms stems from the issue of how an algorithm receives its input. In 
the traditional viewFail input is present immediately: for examplerif the algorithm is supposed 
to be computing the shortest path between two nodes in a graphrthen one assumes that it has 
access to entire graph from the start. This is clearly a reasonable assumption in a large number 
of situations; howeverrthere are equally natural situations in which it makes sense to imagine 
that the algorithm is receiving its input in an on-line fashion: the input arrives in a sequence 
of small piecesrand with the arrival of each piecerthe algorithm must make certain immediate 
and irrevocable decisions. 

In this senserthere is nothing new about on-line algorithms. Certainlyrthe difference be- 
tween "one-pass" and "two-pass" algorithms has always been a fundamental issue in designing 
computer programs; and this is essentially the same distinction expressed in different termi- 
nology. In other casesrthe circumstances of the problem force us to work within the on-line 
framework — for exampleFpaging algorithms evict pages from memoryrcall-control algorithms 
route connections through a networkrand trading strategies tell investors to buy and sehTail of 
them without knowing what the future sequence of "input" will look like. For such problemsr 
there are only on-line algorithms. 

At the same timer howeverrthere has been a tremendous increase in the number of "on- 
line algorithms" papers over the last ten years; this has derived mainly from the increasing 
popularity of competitive analysis as a perspective from which to discuss the performance of 
such algorithms. In the following sectionr we discuss competitive analysis in generair and 
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specifically the paper of Sleator and Tarjan [ST]. 

2.1 Competitive Analysis 

Typicallyran on-line algorithm is trying to solve some optimization problem (as in the examples 
cited above). Thusrif A is an on-line algorithmrand a is an input sequencer then the crucial 
parameter which must be minimized in most on-line problems is A(cr)Tt}ie cost incurred by A 
on a. 

For any given problemFwe can consider the optimal off-line algorithm OPTFwhich sees the 
entire input ahead of time and thus always achieves the minimum possible cost on a. In the 
traditional analysis of algorithmsrthe fundamental question is how computationally "expensive" 
(in terms of time or space complexity) it is to design the algorithm OPT . In the competitive 
analysis of on-line algorithmsFone instead takes OPT as a given and asks how close an on-line 
algorithm A can come to achieving the performance of OPT . 

Fhus we say that an on-line algorithm A is c-competitive if for some absolute constant (3T 
the expression 

A(a)-c-OPT(a) 

is bounded by (3 for all input sequences a. We will sometimes say that A is "competitive" if it 
is c-competitive for some constant c; the infimum of the set of c for which A is c-competitive is 
its competitive ratio. 

From a historical point of viewrit is quite difficult to identify the first use of this performance 
measure as applied to on-line algorithms. It appears in the bin-packing literature of the 1970'sr 
which considered the well-known problem of determining the fewest number of "binsF each 
of capacity lTwhich are required to contain a set of n objects with weights Wi, . . . , w n < 1. 
Fhis problem is NP-Completerbut a number of very natural heuristicsFseveral of them on-line 
algorithmsFgive constant-factor approximations to the minimum number of bins required. 

In this settingr an on-line algorithm is simply one which decides where to place the i th 
object before seeing the values of w J+1 , . . . , w n . One such example is First FitT which puts the 
object in the first bin in which it will fitTstarting a new bin only if necessary. GareyFGrahamr 
and Ullman [GGU] proved that the competitive ratio of First Fit is y|rand an almost exact 
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characterization of the performance of First FitTas well as a number of other approximation 
algorithmsFwas given by Johnson in his Ph.D. thesis [J]. In a result very much in the spirit 
of current work in on-line algorithmsFYao proved in 1980 that there is no on-line bin-packing 
algorithm with a performance ratio better than | [Yao]. 

The competitive ratio also appears sporadically (and again not cast in the current terminol- 
ogy) in work on graph- coloring. The First Fit algorithm has a natural meaning in this world 
as well: proceed through the vertices one at a timer coloring each with the lowest-numbered 
color possible. While First Fit performs abysmally on some classes of graphsrit has long been 
observed to have relatively good behavior on others. The survey paper of Kierstead and Trotter 
[KT] mentions a number of problems related to the First Fit coloring algorithm; for exampler 
Woodall asked in 1974 whether the number of colors used by First Fit was within a constant 
factor of x(G) when G is an interval graph (see also [Kier]). In the new languagerthis is simply 
the question of whether First Fit achieves a constant competitive ratio for this type of graph. 
In the last five yearsrthere have been a number of papers on "on-line graph coloring" in the 
competitive framework; these will be discussed briefly in Section 2.3. 

But it was the work of Sleator and Tarjan in 1985 [ST] which launched the current deluge 
of papers. [ST] considered two classical problems of computer sciencer within the setting of 
competitive analysis: the maintenance of a dynamic data structureFand paging. 

On the first of these questionsr [ST] dealt specifically with the problem of maintaining a 
linked list under a sequence of insertT deleteTand member? requests. This problem had a long 
historyTand many heuristics had been proposed. In the context of on-line algorithmsFas in the 
actual applicationsFone is trying to minimize the total number of memory references over the 
course of a request sequence (i.e. one always wants the current item referenced to be as close 
to the front of the list as possible). What Sleator and Tarjan showed was that the well-known 
heuristic MOVE-TO-FRONT (always put the item just accessed at the front of the list) is 2- 
competitive. One thing that makes this result particularly striking is that the competitive ratio 
of such a simple rule should be so lowFespecially when there is in fact no good characterization 
known for the optimal off-line algorithm. The proof was also interesting for its use of a potential 
function argumentTa technique that was to become extremely common in subsequent research 
in on-line algorithms (we will give an example of such an argument in Section 2.2). 
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The results in [ST] concerned with paging gave much stronger lower bounds. Here one 
pictures a computer with k pages of fast memory and an unlimited number of additional pages 
of slow memory (e.g. a disk). When a reference is made to a page not in fast memoryra "page 
fault" occurs: the referenced page must be moved inrand some page currently sitting in fast 
memory must be swapped out. The goal is to minimize the number of page faultsFover the 
course of a string a of memory references. In [ST] it was proved that no on-line paging algorithm 
with k pages can be better than £;-competitiveFand a number of well-known algorithms such 
as LRU (evict the least-recently used page) and FIFO (evict the page that was brought in 
longest ago) match this bound. Of courserfrom the point of view of operating system designr 
a performance guarantee of k is ridiculously large — a rule such as LRU actually performs 
extremely well in practice — but the result does lend some insight into why looking at previous 
requests should help in designing a paging algorithm. Various extensions to the model of [ST] 
have been proposed [BIRSTKPRjrbut the question of why on-line paging works so well in real 
life remains an intriguing one. 

Following [ST]Tthe paper [KMRS] analyzed additional on-line strategies for cache manage- 
ment (which is essentially the same as paging in this model)rand [BLS] proposed "metrical task 
systems" as an abstract model for studying on-line algorithms. The following yearFManasser 
McGeochrand Sleator [MMS] introduced what has become perhaps the most well-known and 
well-studied on-line problem: the A;-server problem. 

2.2 The A>Server Problem 

We imagine the following situation. An on-line algorithm A controls k mobile servers which are 
free to move around in some metric space M. A finite request sequence a is now presented to 
the algorithmFone request at a time. Each request is a point in the space M; the algorithm must 
move one of the servers to this point before seeing the next request. The goal is to minimize the 
total distance traveled by the serversFover the entire request sequence. The A;-server problem 
generalizes paging in the following sense: if M is the set of possible pages of memoryFwith 
the uniform metric (all non-trivial distances are l)Tand the servers are the k "slots" of fast 
memoryrthen the problem becomes that of minimizing the number of page faults. 

In their original paperFManasseFMcGeochrand Sleator showed that for every metric space 
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M and every on-line algorithm ylLthere are arbitrarily long request sequences on which 

A(a) > k-OPT(a)-0(l). 

That isTno on-line algorithm can be better than ^-competitive in any metric space. Note that 
LRU or FIFO provides a matching upper bound for the uniform metric space; [MMS] also 
provided a 2-competitive 2-server algorithm and a ^-competitive A;-server algorithm for any 
metric space with only k + 1 points. Based on these special casesrthey advanced the k-server 
conjecture: for every metric space MTthere is a ^-competitive A;-server algorithm. At the time 
of this writingr the conjecture is still open; howeverr significant progress has recently been 
achieved in [KP]. 

One of the tantalizing aspects of this problem is that most natural algorithms — for exampleL 
the greedy algorithm (move the closest server) as well as most straightforward generalizations 
of LRU — do not achieve any bounded competitive ratio in general. IndeedLit was not known 
initially whether one could even achieve a constant competitive ratio (depending only on k) 
in an arbitrary metric spaceLeven for the case of three servers. This was settled by FiatL 
RabaniLand Ravid [FRR]Lwho gave a general algorithm with a competitive ratio of at most 
2°( kl °g k ). Grove improved this bound to 2°^ [Gr] using a simple randomized algorithmLand 
derandomization techniques of Ben-David et. al. [BBKTW]. And very recentlyLKoutsoupias 
and Papadimitriou [KP] have shown that the "work-function algorithm" proposed by Chrobak 
and Larmore [CL4]Las by well as other researchersLis at most (2k — l)-competitive. 

In a separate directionLa number of papers extended the set of metric spaces for which good 
server algorithms were known. ChrobakLKarloffLPayneLand Vishwanathan [CKPV] gave an 
elegant ^-competitive algorithm for k servers on the line (i.e. M = R 1 ). The algorithm works 
as follows: let [a, b] denote the interval on the line whose left endpoint is the leftmost server 
and whose right endpoint is the rightmost server. If the request falls outside this intervalLmove 
only the closest server; otherwiseLmove the two "neighboring" servers at the same speed toward 
the request until the closer one reaches it. SubsequentlyL Chrobak and Larmore showed that a 
natural generalization of this algorithm for the case in which M is a tree is also ^-competitive 
[CL3]. 

The proof that this algorithm is ^-competitive makes use of a shortLif somewhat mysteriousL 
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potential function argument. We describe the proof for the case of two servers on a lineFsince 
it provides a good example of how such a proof proceeds. 

A potential function $ is a function defined on the set of possible configurations of the 
on-line and off-line servers. ThusFwhen any server movesrthe value of $ may change. We 
denote the i th request in the request sequence a by <7 8 Tand the cost incurred by A and OPT 
on this single request by A((Ji) and OPT(ai) respectively. Consider the following sequence of 
events: for each i = 1,2, . . Ffirst OPT serves the request <7 8 Tand then A serves <7j. We will 
watch how the value of $ changes over the course of these movements of the servers. Suppose 
we can show the following three facts: 

1. $ is always non-negative. 

2. There is a constant c such that whenever OPT serves a request <7 8 T<I> goes up by at most 
c-OPT(<Ti). 

3. When A serves a request <7 8 T<I> decreases by at least A((Ji). 

Note that over the course of the whole request sequencer the total of the increases in $ is 
at most cJ2i OPT(ai) = c ■ OPT(a)T&nd similarly the sum of the decreases in $ is at least 
J2i^( (J i) = A{cf). Thusrif we let <j> denote the value of $ on the initial configuration of the 
serversrthen the fact that $ is always non-negative implies 

A{a) < c-OPT(a) + <S> . 

That isTA is c-competitive. 

It is not difficult to show (see [MMS]) that we can assume OPT is a lazy algorithm: it 
always moves at most one server on each requestTand does not move if it already covers the 
requested point. If x and y are points on the linelTet xy denote the distance between them. For 
the case of two servers in M = R 1 Tlet s l7 s 2 denote the positions of algorithm ,4's servers!^, o 2 
the positions of OPT's serversFand M min the minimum of SiOi + s 2 o 2 and Si0 2 + s 2 Oi (i.e. it is 
the minimum-cost matching between the servers of A and OPT.) The potential function used 
in [CKPV] is an adaptation of one used by CoppersmithrDoyleFRaghavanFand Snir [CDRS]; 
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for our purposesrwe can write it as 

$ = 2M min + s lS2 . 

Clearly $ is always non-negative. When OPT moves a distance dTit cannot affect the second 
termrand it increases the first term by at most 2d. On a request in which A moves only one 
serverrsay a distance cTthe first term will go down by 2d and the second will go up by d. On a 
request in which A moves both servers a distance rfTthe first term cannot increaseFwhire the 
second goes down by 2d'. (Verifying each of these facts is a fairly straightforward exercise.) As 
this shows that $ has the required three propertiesrthe proof is complete. 

In this thesisrwe will only be considering 2-server algorithms. Note that in this settingr 
there is no "A;-server conjecture" — it has been known since the origin of the problem that 
there is a 2-competitiveron-line algorithm for two servers in a general metric spaceFand that 
no on-line algorithm can achieve a better ratio. (Other 2-competitive 2-server algorithmsrbased 
on quite different techniquesrhave been given by Chrobak and Larmore in [CL]TCL4].) Thus 
the emphasis here has been to give competitive algorithms that are as "simple" as possible — 
simple both in the effort required to analyze themFand in the amount of computation they 
must perform on each request. 

Specificallyr all known optimal 2-server algorithms use more than a constant amount of 
time and space per request; indeedrwhen deciding which server to send to a point in MTthey 
must perform computations based on the entire set of points constituting the previous requests. 
Given the nature of the problemrit might not have been unreasonable to have conjectured that 
this much computation is required to achieve any constant ratio. This is not the caserhowever. 
Irani and Rubinfeld [IR] gave the first competitive 2-server algorithm which performs only a 
constant amount of work on each request; its competitive ratio is known to be somewhere 
between 6 and 10. A "fast" 2-server algorithm with a competitive ratio of 4 was subsequently 
given by Chrobak and Farmore [CF2]. This issue of the computational resources required by 
server algorithms will be the subject of Chapter 5. 
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2.3 Other On-Line Problems 

We now give a brief description of some other areas which have been studied from the perspective 
of competitive analysisrbefore turning to robot navigationrthe other main area considered in 
this thesis. By nowFmany on-line problems have been studied; the following is not meant to 
be comprehensive. 

As mentioned aboverthe surge of activity following [ST] led to some new results in on-line 
graph coloringr already a topic of interest in the 1970's. One basic question was the following: 
is there an on-line graph coloring algorithm with competitive ratio o(n)1 Neither First Fit nor 
any of the other standard on-line coloring algorithms are better than 0(ra)-competitiveFwhich 
is unfortunately also (to within constant factors) the competitive ratio of the algorithm which 
never re-uses a color. This question was answered in the affirmative in 1989 by FovaszFSaksr 
and Trotter [FST]Twho gave an on-line algorithm with performance ratio ( { " ) . In terms 
of lower boundsrHalldorsson and Szegedy [HS] showed that for every on-line algorithm A and 
integer ATthere is a graph with at most k(2 k — 1) vertices and chromatic number k on which A 
will use 2 k — 1 colors. ThusFno on-line algorithm can be better than 0( " )-competitive on all 
graphs. Related work on on-line graph coloring has been done by Irani [Ir] and Vishwanathan 
[Vi]. 

The problem of virtual circuit routing through a network is naturally on-lineFand has been 
studied from that point of view in a number of recent papers. The model here is the following: 
one has a graph G which represents a high-speed networkrand users at the nodes of the network 
who wish to transmit large amounts of data to each other. At each stepFa pair of users requests 
that a connection be established between them. There are many variationsrbut in general a 
connection is specified by its two endpoints s and ^rthe amount of bandwidth requestedrand 
the amount of time for which the connection must be in place. It is then the job of the on-line 
algorithm A to decide on a path from s to t on which the connection will be routed. Typicallyr 
A is trying to minimize the congestion in the network; thusrthe cost incurred by A could be 
the maximum amount of bandwidth that passes through any link of G at any point in time. 

The above description is essentially of the model considered in [AAFPW]Tand in [AAPW] 
when the duration of the connection is unknown but re-routing is allowed. An alternativerbut 
similarFscenario is the following: the network does not have sufficient bandwidth to handle all 
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requests; thus the on-line algorithm A must first decide whether to accept the connectionFand 
if sorto route it through the network. This is the set-up considered in [AAPrABFR]; a closely 
related problem is that of scheduling "intervals" on a line [LT]. One recurring theme in this 
work is that randomized algorithms often can achieve competitive ratios that are exponentially 
better than the lower bound for deterministic algorithms. 

A somewhat related problem was proposed by Imase and Waxman [IW]Tthough it has been 
suggested independently in a number of sources. This is the on-line Steiner tree problem: one 
has a metric space MTand n points in M which must be joined together by a minimum-cost 
Steiner tree. The sequence is presented on-liner and each new point must be immediately 
connected to the existing tree. The on-line/off-line distinction here sort of resembles an "urban 
growth" phenomenon — after the fact r it is much easier to find a reasonably short spanning 
network than when the points are appearing one at a time and must be hooked up immediately. 
[IW] shows that the greedy algorithm (when point x is requestedr connect it to the existing 
network via the shortest possible path) is O(logra)-competitive in any metric space; they also 
present a somewhat contrived metric space M in which no on-line algorithm can be better than 
O(log ra)-competitive. Alon and Azar [AA] subsequently showed that in the Euclidean planer 
no on-line Steiner tree algorithm can be better than 0( l( ^°f" )-competitiverieaving a slight gap 
between the upper and lower bounds in this case. 

2.4 Robot Navigation 

Maze-solving is an old obsession. The interest in such problems can be seen in the labyrinths of 
mythology and medieval architecture; andrthe bridges of Konigsberg notwithstandingrit was 
one of the original motivations for study of graph theory. Maze-solving algorithms phrased in 
the terminology of graphs can be found in work of Tarry and Tremeaux that reaches back a 
century and more [Ore]. More recentlyFsuch questions have been addressed in the contexts of 
automata theory by Blum and Kozen [BK] and robotics by Lumelsky and Stepanov [LuS]. 

On-line navigation is concerned with questions somewhat more general than maze-solving: 
the environment will not always be nefariousrthe goal not always to escape. Rather rwe will be 
interested in a variety of tasks which must be performed in an environment for which the mapr 
or some other crucial piece of informationris not known. Although the focus is theoreticair 
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the motivation comes from the ever-growing field of autonomous mobile robotsPand we should 
frequently ask whether the problems we consider have some bearing on the real-world issues of 
robotics. 

Navigation in an unknown environment does not fit precisely into the framework described 
aboverin that the mobile robot has a more interactive relationship with the "input" than in 
previous problems. Neverthelessrthe problem is naturally on-linerin that the environment is 
presented in a piecemeal fashionPas the robot sees new parts of itTand the goal is (in general) 
to minimize the total distance traveled. The recent incorporation of navigation problems into 
the setting of competitive analysis comes mainly from the work of Baeza-YatesPCulbersonPand 
Rawlins [BCR]Tand Papadimitriou and Yannakakis [PY]. 

[BCR] does not speak directly in terms of the competitive ratiorbut its focus on the ratio of 
the robot's distance traveled to the length of the shortest path is clear enough. The questions it 
considers are variations on the theme of searching for an object at an unknown location in the 
plane or on a line. These includerfor examplerthe problem of searching for a point on a line 
or a collection of linesPand the well-known "lost-at-sea" problemrin which one must determine 
the optimal search pattern for finding a line (the shore) in the plane (the ocean) [Beris]. 

A fundamental technique introduced in [BCR] is that of spiral search — an uncomplicated 
idea which has proved to be a valuable building block in numerous on-line algorithms. A good 
example of its use is in the problem of a robot searching for a goal which lies on one of n roads 
meeting at the origin. We will assume here that the robot is constrained to move on these roads 
and cannot use vision; howeverrit will recognize the goal when it comes to it. Moreoverrto 
prevent certain pathologiesPwe will assume that the goal is at least one unit of distance away 
from the origin. 

Clearly no on-line algorithm can be better than S7(n)-competitive for this problem: to 
construct a lower boundrwe simulate the robot on a set of roads which does not contain a goair 
and number the roads r l7 . . . , r n according to when the robot first travels one unit of distance 
on each. We then simply place the goal one unit from the origin on road r n . The robot will 
travel a distance of 2ra — IT whereas the shortest path to the goal is 1. A simple form of the 
spiral search solution to this problem goes as follows: for d = 1, 2,4, . . ., 2 J , . . Xexplore each 
road out to a distance d and return to the origin. If the goal is found when d = lTthe robot has 
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traveled at most 2ra — 1 times the length of the shortest path. OtherwiseFsuppose the goal was 
found when d = 2 J for some j ' > 1. Then the total distance traveled by the robot is at most 

i 
J2n-2-2 j < 2 j+2 n. 

8 = 

Meanwhiler since the goal was not found when d was equal to 2 J_1 rthe length of the shortest 
path is at least 2 J_1 . Thusrthe robot travels at most 8ra times the length of the shortest path. 
In [BCR] it is shown that by modifying these parameters a littler one achieve a competitive 
ratio of 2era — o(ra)Tand that this is optimal up to low-order terms. 

As is the case with most of the basic notions discussed hereFone can find the spiral search 
technique being used implicitly in a wide range of previous papers; its wide applicability is 
obviously due to its simplicity — keep doubling until you succeed — and the relative ease of 
analyzing the performance guarantee one gets — the sum of all that you've done in previous 
phases is only a constant fraction of what you do in the current phase. Recentlyrit has been 
used in many of the robot search papers we will discuss below [BCRrPYrBRaScFKRTrKl]r 
an abstract kind of navigation problem known as layered graph traversal [PYrFFKRRV]rthe 
design of hybrid algorithms [KMSY]Tand even the approximation of some NP-hard problems 
[BCCPRSrTWSY]. 



i i : n t 



Figure 2-1: A rectangle packing 

The work of Papadimitriou and Yannakakis followed [BCR] and deals with navigation prob- 
lems that are somewhat less stylized. Specificallyr they consider a robot with vision moving 
around in a plane filled with rectangular obstacles. The robot can move through any part of the 
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plane not filled in by an obstacleFand there is always room to move between adjacent obstacles 
(i.e. they cannot be "stuck together" to make more complex shapes). By visionTboth here and 
in what follows r we mean the following: if the robot is at xTit can see the point y if the line 
segment ~xy does not meet any obstacle. Finallyriet d(s,t) denote the length of the shortest 
obstacle-avoiding path between points s and t. 

Based on the above definitionsFwe will say that a rectangle packing is a collection of axis- 
aligned rectanglesFeach of which has at least unit thicknessr such that there is always just 
enough room for the robot to pass between neighboring rectangles. [PY] deals with shortest- 
path problems in this setting: the robot starts at a point s and knows the coordinates of a point 
t that it wants to reach. Fhe point t is assumed to be in free space — i.e not in the middle of 
an obstacle. Fet n denote the straight-line distance from s to t; note that this could be much 
less than d(s,t). When all the obstacles in the packing are squaresr[PY] gives an algorithm 
with competitive ratio |v26 ~ 1.70Fand shows that no on-line algorithm can be better than 
|-competitive. For the case of an arbitrary rectangle packingFno bounded competitive ratio is 
possible: [PY] shows a lower bound of £l(^/n) on the best possible competitive ratio. 

BlumrRaghavanFand Schieber [BRaSc] address the same type of shortest-paths problems 
and give an O(y / ra)-competitive algorithm for the s-t path problem in a rectangle packing; this 
matches the lower bound of [PY] up to constant factors. [BRaSc] also introduced the elegant 
"room problem" — consider a rectangle packing inside a large 2ra X 2ra square room (so there 
is space to move along the walls as well)rsuch that the center of the room is in free space. Fhe 
robot wants to start at one corner s of the room and reach the center t. Fhe intriguing point 
here is that there is always a path from s to t of length 2ra — simply start at tTmove west as 
far as possiblerthen south as far as possiblerthen west . . . and so on — and this construction 
is of course on-line. But can the robot starting at s somehow reverse this process and find a 
path to t of length 0{n)1 [BRaSc] left this as an open questionFproviding only an algorithm 
to generate a path of length 0(n2V logn ); the question was answered a year later by Bar-EhT 
BermanrFiatrand Yan [BBFYjTwho showed by a very complicated argument that no on-line 
algorithm can be guaranteed to find a path shorter than S7(nlog ra)Tand gave an algorithm with 
a performance guarantee matching this up to constant factors. 

Fess work has been done on geometric problems more in the spirit of [BCR] — that isrtrying 
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to find a short path to a goal t when neither the map of the environment nor the location of t 
is known. This will be the subject of Chapter 3. 

Finallyr there have been several papers on the related problem of exploring an unknown 
environment. Hererthe goal is to traverse a path that sees all parts of the environmentTboth 
the obstacles and the free space. For the problem of exploring the interior (or exterior) of 
a simple rectilinear polygonFDengFKamedaFand Papadimitriou [DKP] give a 2-competitive 
algorithm. The off-line version of this problem (what is the shortest path for exploring a given 
polygon PI) is perhaps better motivated as the "Shortest Watchman's Route Problem"; Chin 
and Ntafos [CN] give a polynomial-time algorithm for computing such a router and some of 
their definitions are used in the algorithm of [DKP]. BetkeFRivestTand Singh [BRiSi] consider 
compact exploration algorithms: the robot always knows the shortest path back to the origin 
whenever it reaches a new point. Their main result is a compact exploration algorithm for 
rectangle packingsr which travels at most a constant times the total perimeter length of all 
rectangles in the scene (this is a lower bound for the optimal solution ifT for exampler we 
assume that the robot has very limited vision). 

For our purposesFexploration algorithms (on suitably restricted parts of the environment) 
will prove useful as subroutines in Chapters 3 and 4. 
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Chapter 3 



Searching an Unknown Polygon 



We are interested in geometric variants of the problem of searching with uncertainty. A natural 
problem in robotics is that of searching for a goal in an unknown polygonal region. For exampler 
a robot with vision is placed at a starting point in a simple polygonFand it must traverse a 
path to some target point in the polygon. Both the location of the target and and the geometry 
of the polygon are unknownrbut the robot will recognize the target when it sees it. 

As discussed in the previous chapterFon-line search algorithms have generally been devel- 
oped for situations in which the geometry is kept to a minimum — for exampler the case of 
searching for a point on one of m concurrent rays [BCRrKRTrKMSY]ra line in the planer 
or a point in an integer lattice [BCR]. Moreoverrthe structure of the space to be searched is 
assumed to be known — only the location of the target is unknown. [BCR] writesr". . .these 
problems are (very simple) models of searching in the real- world. It is very often the case that 
we do not know many of the parameters that are usually taken for granted in designing search 
algorithms." 

HereFwe provide an on-line algorithm for the general problem of searching for a target 
point at an unknown location in an unknown and arbitrary simple rectilinear polygon. Fhusr 
the robot must adapt its search pattern as it sees more and more of the polygon. For a given 
rectilinear polygon PFwe identify the number of essential cuts of P [CNrDKP] as a fundamental 
parameter in determining the best competitive ratio attainable in searching P — it is easy to 
cast the problem of searching m concurrent rays as a search problem in a polygon with m 
essential cutsFwhence the [BCR] lower bound of 2em — o(m) on competitive ratio applies (e 
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is being used to denote the base of the natural logarithm here). Thusrthe natural question 
is whether there is an algorithm which is O(m)-competitive for the problem of searching an 
arbitrary simple rectilinear polygon with m essential cuts. In Section 3.3rwe present such an 
algorithm. 

The algorithm is based on the problem — first considered in [DKP] — of exploring a simple 
rectilinear polygon Prstarting from and returning to a fixed point s in P; we use an exploration 
algorithm iteratively as the search proceeds. We show in Section 3.2 how an adaptation of the 
technique in [DKP] gives a randomized algorithm which is 5/4-competitive in the L\ norm when 
s is any point inside P. 

Another direction in which one could investigate such search problems is to restrict the 
class of polygons and target points in such a way that a constant competitive ratio can be 
achieved for the search problem. Such an approach has been adopted by Klein in his work on 
streets. The papers [IKTK] introduce the term street to define a class of general (not necessarily 
rectilinear) polygons with two distinguished points s and ^rsuch that the two st boundary 
chains are mutually weakly visible (see below for an elaboration on this definition). In [K]T 
Klein gives a 1 + |7r-competitive (~ 5.71-competitive) algorithm for finding t from s in an 
unknown street P. Howeverrthe algorithm and its analysis are quite involved. In Section 3.1 



we give a simple algorithm with a competitive ratio of at most y 4 + y8 (~ 2.61). Moreover 
when P is rectilinearrit achieves the optimal ratio of y2- We believe it would be interesting in 
general to find other natural classes of polygons that can be searched competitively. 

FinallyTa word about the distance metrics used in this chapter. The distance between two 
points as measured in the L x and L 2 (Euclidean) metrics differs at most by a factor of V2; 
thusran algorithm which is c-competitive in L x is cv2-competitive in L 2 . With the exception 
of the algorithm for traversing streetsFwhich is analyzed directly in the Euclidean metricFwe 
present our results in the conceptually neater framework of the L\ metric. In view of the tight 
correspondence between L\ and Z^Tour final search algorithm is O(m)-competitive in both. 

3.1 Traversing an Unknown Street 

Let P be a simple polygon and s and t two distinguished points on the boundary. The removal 
of s and t would disconnect the boundary into two polygonal chainsrZ and R. We say that 
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P is a street [IKTK] if each point on the boundary of P can see some point on the opposite 
boundary chain. The goal is for a robot with vision to travel from s to t; neither the map of P 
nor the coordinates of t are known. The cost incurred by the robot is the length of the path it 
generatesrand its competitive ratio is taken with respect to the length of the shortest s-t path 
in P; distances are measured in the Euclidean metric. 



b) 
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Figure 3-1: Streets and their views 



Figure 3-l(a) can be completed to form a rectilinear street in which t could be just around 
the corner from either X or Y [K]. The robot will incur the best worst-case performance if it 



moves directly to segment XlTthen to t (it will see t when it reaches XY). This can be at most 
a factor of y2 longer than the shortest path. Curiouslyrthis is the only known lower bound on 
the competitive ratio achievable for the problem. In [K]Tan algorithm with a competitive ratio 
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of at most 1 + |7r (~ 5.71) is presented. BelowFwe give an algorithm with competitive ratio at 
most ^4 + ^8 (~ 2.61). 

The example of Figure 3-l(a) is central to the proof technique we develop in this sectionr 
and it highlights a principle that will appear repeatedly in what follows — on-line algorithms 
hate making decisions. Specificallyrit is useful in many navigation problems to adopt a strategy 
that preserves the robot's options for as long as possible. In the figurer suppose the robot at 



point s is moving towards segment XlTbut it has not yet decided whether it ultimately wants 
to visit point X or Y . Let us define a polygonal path to be monotone if the x- and y-coordinates 
of the points on the path change from their initial to final values monotonically. Then the key 
observation is that in the L\ metricFany monotone path between two points is a shortest pathr 



so the robot can defer its decision (X or Y) until it reaches segment XY and still have the 
option of traveling optimally to either point. The related fact for the Euclidean metric is that 
any monotone path between two points in the plane has length at most a factor of y2 times 
the straight-line distance between them. Thusrin this examplerthe robot can move to segment 
XY before making a decision and travel only y/2 times too far (Euclidean distance) in the worst 
case. 

Let P be a streetTand assume that the robot is currently located at a point x inside P. The 
robot maintains an extended view of P; this consists of all points on the boundary of P that it 
has seen so far. The robot's extended view will typically look like the example of Figure 3-l(b). 
We define a cave C to be a connected chain of the boundary of P such that the robot has seen 
the endpoints of the chain but no other points of it. At some point p' on the robot's pathrthese 
two endpoints were on the same line of sight from p'; call the one closer to p' the "mouth" of 
C . Each cavemouth v is a reflex vertex of P; in the neighborhood of vTP lies either to the left 
or right of the ray p'v. We accordingly refer to v as being either a left or right cavemouth. 

At any given point in timer it is useful to picture the "forward" direction for the robot 
as being along the positive y-axis; so the negative s-axis lies immediately to its left and the 
positive s-axis immediately to its right. In keeping with this terminologyrif u and u' are two 
vectorsFeach with non-negative y coordinateFwe will say that u is to the left of u' if it forms a 
smaller angle with the negative s-axis. Assume that t has not yet been seenFand the robot has 
maintained the invariant that the points in its extended view immediately to its left and right 
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belong to L and R respectively. In view of these assumptionsFwe assemble some facts about 
extended views of a street before presenting the algorithm itself. The first is a standard fact 
about shortest paths inside any simple polygon. 

Lemma 3.1 If t is contained in a cave C, then the shortest x-t path touches the mouth of C . 

Lemma 3.2 Let p be a point on boundary chain L (resp. R), and let $ be the boundary chain 
sp of P contained in L (R). If the robot moves from s to p, it will have seen every point on $. 



Proof. Every point on boundary chain $ must be able to see some point on R; but all such 
lines of sight to R cross the robot's path from s to p. Thus the robot has seen every point on 
'!'. I 

Lemma 3.3 If v is a left (right) cavemouth, it belongs to boundary chain L (R). 

Proof. Assume v is a left cavemouthrf £ iiTand v was seen from point p' . The chain 
determined by a clockwise scan of the boundary from v to s (taken from point p') is entirely 
contained in R. Thusrif the robot were to walk directly from p' to vTit would have seen all 
of this chainrby Lemma 3.2. But since v is a cavemouthrit would not have seen any point 
on the boundary of P just around the corner from vT which belongs to this chain; this is a 
contradiction. I 

Corollary 3.4 In the extended view, all left cavemouths lie to the left of all right cavemouths. 



If the extended view contains any left cavemouthsFwe define c; to be the rightmost one. The 
point c r is defined analogously for right cavemouths. If both c; and c r are definedrthe chain 
between the far endpoints of their respective caves must be completely visible in the extended 
view; otherwiserit would contain an additional left or right cavemouth. Combining this with 
Lemma 3.1 and the fact that t has not been seenL 

Lemma 3.5 The point t lies in the cave of either C\ or c r . Consequently, the shortest path 
from x to t touches either C\ or c r . 
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Let d(-, •) denote the length of the shortest path between two points in P. The shortest 
path from s to ^denoted by TOs a chain of line segments joined at reflex vertices of P. There 
is a natural order on the vertices of ITdetermined by traversing it from s to t. Our algorithm 
works iterativelyr allowing the robot to move from a given vertex of T to a later oneFwith small 
"detour." The following theorem provides the main inductive step. 

Theorem 3.6 Assume that the robot is currently located at a vertex x £ T. Then it can move 
to a later vertex i'gT while traveling at most (y4 + y/8)d(x,x'). 

Proof. We present an (on-line) algorithm for doing this. Based on the robot's extended viewr 
there are four cases to consider. (See Figure 3-2.) 




Case 2 





Case 3 



Case 4 



Figure 3-2: The algorithm at work 

Case 1. If t is visiblerthe robot moves directly to t. The distance traveled is d(x,t). 

Case 2. If c r (c;) is not defined (there are no right (left) cavemouths)rthen by Lemma 3.5r 
r passes directly through c;. Thusrthe robot moves directly to cTfollowing T the whole way. 

Otherwiserboth c; and c r are visible. The robot chooses a direction of motion such that 
C; lies to its left and c r lies to its right. We view this as a coordinate system in which the 
robot is the origin and it is moving in the direction of the positive y-axis; thusFc; has negative 
^-coordinate and c r has positive ^-coordinate. The robot moves in this directionFupdating its 
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extended view and the points C/,c r Luntil one of of the above two cases appliesLor one of the 
following two: 

Case 3. The point c; (or respectively c r ) "jumps" to the opposite side of the y-axis. At the 
moment when this happensrboth c; and c r will lie on the same line of sight. If the robot moves 
in this direction until it hits the nearer onerit will once again be on a point x' G IT having 
followed a path from x to x' that was monotone with respect to the chosen coordinate system. 
Thusrit has traveled no more than y2 times the distance from x to x' along L. 

Case 4. If none of Cases lL2Lor 3 appliesLthen there comes a point at which the robot's 
line of sight to c; (c r ) is parallel to the s-axis. At the moment when this happensLthe robot 
is "confused"; it can no longer follow a path guaranteed to be monotone to either c; or c r . 
Howeverrthe robot can adopt the following approach to return to the shortest path. 

Let us translate the coordinate system so that the robot is again at the origin (so c; now 
lies on the negative s-axis). Since t has not yet been seenLand since the second quadrant (i.e. 
{x < 0, y > 0}) is free of cavemouthsrthe robot can see all of the contiguous boundary chain X 
lying in this quadrant. ThusLX must be entirely contained in L or R; the robot can return to 
r once it discovers which of these cases holds. Define L x to be the portion of boundary chain 
between c; and the endpoint of X lying on the negative s-axis; define R x to be the portion of 
boundary chain between c r and the endpoint of X lying on the positive y-axis. 

The robot begins moving in the direction of the vector ( — 1, l)Lupdating its extended view 
and the points C/Lc r Luntil it sees t or one of the following events occurs: 

1. C; has the same ^-coordinate as the robotLor c r has the same y-coordinate as the robot. 

2. One of the chains L x or R x becomes completely visible. 

If event (2) occurs firstTthen the robot will be able to move to the opposite cavemouthTthereby 
returning to T. Suppose event (1) occurs first; assume for the sake of concreteness that c; has 
the same ^-coordinate as the robot (the other case is strictly analogous). Then a point just 
around the corner from c; can only see points lying on X U L x ', this implies that X C iiTand 
so the robot can return to T by moving to c;. 

We must now bound how far the robot travels by implementing this strategy. Let r denote 
the distance traveled by the robot prior to becoming confused. First consider the case in which 
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event (1) occursLand c r has the same y-coordinate as the robot. Suppose that the coordinates 
of c r are (v, u); then the robot travels r + M(l + A/2) + fLwhire we have d(x, x') > yj{r + u) 2 + v 2 . 
Thusrthe worst-case ratio incurred by the robot is bounded by 

r + u(l + \/2) + v 



\/(r + u) 2 + v 2 
A somewhat lengthy but straightforward argument shows that this expression attains its max- 



imum when r = and u = v (1 + v2)Lwith a value of y 4 + V8- 

Now suppose event (1) occursrand c; has the same ^-coordinate as the robot. If the 
coordinates of c; are ( — u, — v )Tthen the robot travels r + u(l + y/2) + v. Assume that when the 
robot first became confusedrthe coordinates of c; were ( — M l7 0) (recall that this point lies on 
the negative s-axis). Let u 2 = u — Ui; then we have 



d(x, x') > 

In view of the simple inequality ^| < inax(^, -|)rthe worst-case ratio is upper-bounded by the 

maximum value of 

r + Ul (l + V2) 



As aboverthis is maximized by taking u x = r(l + v / 2)Twith a value of y 4 + y/8. 

The case in which event (2) occurs first is similar. Suppose that all of L x becomes visible 
and the goal has not been seen (the other case is analogous). Then we can set x' to be the 
current value of c r . Suppose that the coordinates of c r are (v,u); then since event (1) did not 
occur rthe robot's y-coordinate is no more than u. Thus it can get to c r having traveled at most 
r + u(l + V2) + v rwhile again d(x, x') > \/(r + u) 2 + v 2 . 

Finallyr we should note that it is possible for the robot's motion to be stopped by the 
boundary of P. As in Lemma 3.2LhoweverLit will have seen the entirety of one of the caves 
associated with c; or c r by the time this happensLso it can return to L; the preceding analysis 
is not affected. ■ 

Corollary 3.7 For any street P , repeatedly applying the above algorithm produces a path from 
s to t that is at most y 4 + V8 times as long as the shortest s-t path in P. 
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In the case in which P is rectilineaiTthe above algorithm can be implemented so that the 
robot's direction of motion is always along one of the coordinate axes. Then the key observation 
is that Case 4 cannot occur in a rectilinear street (since all angles are right anglesr either a 
point just around the corner from c; could not see iiTor a point just around the corner from c r 
could not see L). Thus we can show 

Theorem 3.8 When P is rectilinear, the above algorithm is \/2-competitive, and this is opti- 
mal. 

3.2 Exploring a Rectilinear Polygon 

FirstTwe present some basic definitions of [CNTDKP] on the structure of rectilinear polygons. 
In this and the remaining sectionsrdistances will be measured in the L\ metric. 
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Figure 3-3: Some simple rectilinear polygons 



Let P be a simple rectilinear polygon and s a distinguished point in P. An edge e of P is 
contained in a line I; we say that an extended edge is a line segment e C I in P which shares one 
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endpoint with erand whose other endpoint is also on the boundary of P. Each edge e induces 
at most two extended edges. AlsoFnote that an extended edge is either horizontal or verticair 
depending on the orientation of its associated edge e. Call e a horizon if there is no path from s 
to e that does not cross e. We can define a partial order on horizons as follows: if h and h! are 
horizonsrthen h < h' (h dominates h') if any path from s to h! must cross hYox if h = h! . The 
horizons of P which are maximal are called essential cuts; for an essential cut hT it is possible 
to start at s and follow a path which crosses every horizon except h. See Figure 3-3. 

A special case of the following lemma is given in [DKP]; it is the underlying reason for the 
success of "greedy" exploration algorithms in rectilinear polygons. 

Lemma 3.9 Let oti, . . .,a n be a set of horizontal and vertical segments in P, and v a point 
in P. The (L\) shortest path beginning at v and touching the a, in order is generated by the 
greedy algorithm, which, from segment a.j, always chooses the shortest path to cij+i- 

The problem we are considering here is that of traversing an exploration route in P: a 
pathr starting and ending at sFsuch that every point of P can be seen from some point on 
the path. The robot does not know the map of P in advance. As noted in [CNrDKP] (see 
also the discussion above)ra closed path through s has this property if and only if it touches 
all essential cuts in P. In [CNrDKP]rit is observed that since any exploration route can be 
traversed (off-line) without self-crossingsrthe shortest exploration route will touch the essential 
cuts in clockwise order. Moreoverrby Femma 3.9Tit will touch the cuts in this order using the 
greedy algorithm. 

Consider the case in which the point s lies on the boundary of Prbetween the endpoints of 
essential cuts e and e' . Fhe on-line algorithm given in [DKP] is essentially a greedy strategy 
which crosses each upcoming horizon as quickly as possible; it is shown in [DKP] that it will 
traverse the greedy path which touches the essential cuts in clockwise orderr beginning with 
e. Consequentlyrthis algorithm finds the optimal exploration route on-line; it is 1-competitive 
when s lies on the boundary of P. 

When s does not lie on the boundaryrthe choice of which essential cut to start with becomes 
cruciairand the robot does not have enough information to make this choice. 

Proposition 3.10 No deterministic algorithm for exploring a simple rectilinear polygon can 
be better than 5/4-competitive. 
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Add small " caves" at 
three of the four points 
A, B, C, D 



Figure 3-4: Lower bound construction 

Proof. Consider Figure 3-4. All the long edges of the polygon P have length 2rthe short 
edges have some length e much less than 2Fand s is at the center. A robot exploring P crosses 
either the upper or lower horizon first; assume the former case. At this pointTit will see two 
tiny "caves" at points A and i?rboth of which must be visited. Assume that it visits A before 
visiting B or crossing the lower horizon (other cases are similar). 

We now add an extra cave at C but not at D. Even if the robot now had the map of PT 
it would have to travel a distance of 8 to visit the caves at B and C and return to s. It has 
traveled a distance of 2 to reach A; thus its total distance is 10. On the other handrthe greedy 
exploration route which visits C first travels a distance of 8. I 

The algorithm given in [DKP] is 2-competitive when s is an arbitrary starting point in 
Prand this is the best known deterministic ratio. In the remainder of this sectionFwe give 
a simple randomized algorithm whose performance matches the deterministic lower bound of 
Proposition 3.10. 

Theorem 3.11 There is 5 / ' ^-competitive randomized exploration algorithm when s is an arbi- 
trary starting point in P. 

Proof. Consider first the following construction. If the robot standing at s were to imagine 
a thin "needle" of boundary extending from the real boundary of P to si 1 it would then be 
on the boundary of this new polygon and could explore optimally. If we restrict ourselves to 
horizontal or vertical segmentsrthen there are four possible needles that can be inserted in P. 
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See Figure 3-5. 



Inserting needles in the polygon 



b) 



The exploration route E avoiding a needle 

Figure 3-5: Polygon with needles 

Let L(-) denote the length of a path in PT E denote the optimal exploration route in PT 
and Pi denote polygon P with the i th needle insertedri = 1, 2, 3, 4. FinailyFwe denote by T; the 
(optimal) exploration route generated by the robot starting from s in P 8 - (s is on the boundary 
of each P). Of courser we are not really interested in the performance of T; in P-; we must 
show that Ti is also not far from optimal in the original polygon P. Set d, = L(Ti) — L(E). 

We claim that J2i=i d*i < L(E); the proof is as follows. Since E visits the essential cuts of 
P in clockwise orderrit meets each needle in at most one point. E can be traversed so as to 
avoid the i th needle (it takes a detour through s); let us denote this longer route by E,. Since 
Ei is an exploration route for P 8 - and T 8 - is optimal in this polygonFwe have L(Ti) < L(Ei). 

Let d[ = L(Ei) — L(E). Consider the four points at which E hits the needles (some of these 
points may be s); connect these by shortest paths to form a closed path T. E is not necessarily 
an exploration route for Prbut we have L(T) < L(E) and L(T) = J2i=i ^'- Since d, < d\ for 
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each iT 

8 = 1 8 = 1 

ThusRf the robot chooses any needlerthe exploration route T; it generates will have length 
at most 2L(E) (and simple examples show thatTstanding at srthere is no way to choose a needle 
guaranteeing a performance better than this). Howeverrthe expected value of the quantity d, 
is bounded by L(E) / ATso if the robot chooses one of the four needles uniformly at randomrthe 
expected length of the exploration route it generates is at most ^L(E). I 

3.3 Searching a Rectilinear Polygon 

Using the exploration algorithm of the preceding sectionFwe can give an O(m)-competitive 
algorithm for the general search problem described at the beginning of this chapter. A point t 
is hidden in a simple rectilinear polygon P with m essential cuts; howeverrthe point t can be 
recognized when it is first seen. A robot starting at sFand without knowledge of the map of PT 
must travel to t; its distance traveled is compared to d(s,t)Tthe length of the shortest s-t path. 
As mentioned in the introductionrO(m) is the best bound possible on the competitive ratio: if 
we take the polygon P m of Figure 3-3Fmake the m "arms" extremely longFand introduce tiny 
bends to limit visibilityrthen the [BCR] lower bound for searching m concurrent rays applies 
— no deterministic algorithm can be better than (2em — o(m))-competitive. (An S7(to) lower 
bound clearly holds for randomized algorithms as wehTwith a somewhat smaller constant.) 

We first assemble some basic lemmas that will be useful in analyzing the exploration algo- 
rithm. Because P is a simple polygonFwe have the following fact. 

Lemma 3.12 Let h and h! be horizons with the same orientation such that for some point v in 
P, every path from s to v must cross both h and h' . Then h and h' are comparable with respect 
to < (h< h' or h' < h). 

Based on this lemmaFwe can represent the partial order ■< restricted to the horizontal segments 
by a directed tree T h in which the root is the point s and the other nodes are horizontal horizons. 
For vertical horizonsrthere is the analogous representation as a tree T v . 

Lemma 3.13 Each of T h and T v has at most m leaves. 
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Proof. A leaf of T h corresponds either to a horizontal essential cutT or a horizon h that 
dominates some vertical essential cut h! . In the latter caseFno other leaf of T h can dominate 
h'Tby Lemma 3.12rso h can be uniquely charged to h! . The same analysis holds for T^Tgiving 
us the stated bound. I 

Lemma 3.14 Let v be a point in P such that s cannot see v. Then there is some horizon h 
in P that separates s from v, such that for any path from s to h, the point v can be seen from 
some point on this path. (I.e. no matter how the robot gets to h, it will have seen v.) 

Proof. By analogy with the construction for streetsFconsider the "view" of P from point v. 
Define a caveFas beforerto be a connected chain of the boundary of P such that v can see the 
endpoints but no other points of the chain. Since s cannot see vTs must lie in some cave C . 
Let u be the cavemouth of C; then there is a horizon with endpoint u such that by the time 
the robot reaches this horizonrit will have crossed the ray vu and seen v. I 

Lemma 3.15 Suppose the robot is moving perpendicularly towards an extended edge e, and no 
other extended edge of the same orientation lies between the robot and e. Then the robot can 
see both endpoints of e. 

Proof. Suppose the robot could not see the right endpoint of e; let x be the rightmost point 
on e that it can see. The line of sight from the robot to x must meet a reflex vertex of P; it is 
easily verified that this is the endpoint of a parallel extended edge e' lying between the robot 
and e. I 

Finallyr the following lemma is the key to designing the search algorithm; it is also an 
interesting fact in itself. As beforeriet d(u, v) denote the length of an L\ shortest u-v path in 
P. 

Lemma 3.16 Let P be a simple polygon (not necessarily rectilinear) , and consider a robot 
traversing some path in P. If points u and v are both visible from this path, then the robot can 
determine d(u,v) without seeing the rest of P. 

Proof. In factTit can compute a shortest path between u and v. Since we are dealing with the 
L\ metricrthe shortest u-v path in P will not generally be unique. HoweverFsome shortest u-v 
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path is polygonal (consists of a finite number of line segments). Consider the extended view of 
P that the robot has generated. By definitionrthe line segment joining the two endpoints of 
a cave in this view is completely contained in P; let us call it a "pseudo-edge." Let P' denote 
the truncated polygon whose boundary consists of the edges and pseudo-edges of the extended 
view of P. Thusrthe robot has seen all of the boundary of P' . 

We claim that there is a shortest u-v path in P that does not leave P' . The result will 
follow since the robot can obviously compute a shortest u-v path in P' . Let T be a polygonal 
shortest u-v path in PL which may enter a cave of the extended viewL crossing pseudo-edge e 
at the point x. Since v lies in P'Lthe path must re-enter P'; let us say that it next does so by 
crossing pseudo-edge e' at point y. Since P is a simple polygonLe = e' . Thus we can form a 
new u-v path which goes directly from x to y when T enters this cave; this operation does not 
increase the length. Proceeding in this wayLwe eliminate the (finitely many) places at which T 
enters a caveLproducing a u-v path T" in P' with L(T') < L(T). I 

Theorem 3.17 There is an algorithm for searching simple rectilinear polygons, which is 0{m)- 
competitive on the class of such polygons with m essential cuts. 

Proof. Lor any 8 > OLlet P d denote the rectilinear polygon obtained by "truncating" P at 
horizons that are more than 8 away from s. Lhere is some small 8 such that all points within 8 
of s are visible from s. By rescalingLwe can assume that 8 = 1. Lor fixed values of <Tthe robot 
will simulate the algorithm of [DKP] in P^Las follows. Lhe [DKP] algorithm has the property 
that at all timesLthe robot is moving along the coordinate axesL perpendicularly towards an 
extended edgeLand it maintains this direction until it reaches the closest such extended edge 
e. By Lemma 3.15Lit can see both of the endpoints of the edge e; thusLby Lemma 3.16Lit can 
tell whether e lies in P d or not. If e lies in P^Lthe robot moves towards itLas in the [DKP] 
algorithm; otherwiseLe is treated as a wall in the simulation. In this wayLthe exploration route 
generated by the simulation is the same as the route that the [DKP] algorithm generates in P s . 

InitiailyL <*) is set to 1. Whenever t is first seenLthe robot moves directly to it; if the robot 
explores P 6 without seeing iLthen 8 is doubled and the next iteration begins. 

Recall the partial orders T h and T v on the horizontal and vertical horizons of P. In P 6 T 
these naturally restrict to partial orders T[ and T^Lwhich are obtained simply by deleting all 
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nodes corresponding to horizons more than 8 away from s. Both T^ and T£ are still directed 
trees rooted at sLand by Lemma 3.13reach has at most m leaves. Since each essential cut of 
P 6 is a leaf of one of these treesLP* 5 has at most 2m essential cuts. Lemma 3.14 implies that 
one way to explore P 6 would be to travel to each of its essential cuts individually; this would 
require a total distance of at most 2m(28) = 4m8. Since the robot is following a 2-competitive 
algorithmLit travels at most 8m8 in exploring P s . 

Given thisLwe can complete the proof as follows. If the robot sees t when 8 = lLthen it 
can travel to it optimally. OtherwiseLassume it first sees t in P^Lwith 8 = 2 k . Since it did not 
see t when exploring P^ 2 ^Lwe have by Lemma 3.14 that d(s,t) > 2 k ~ l . 

NowLwe must bound how far the robot has traveled before reaching t. Until it sees tTit has 
traveled at most 

k 

J2^m2 j < 8m(2 k+1 ) < 32md(s,t). 

3=0 

Since 8 = 2 k < 2d(s,t)T it is easily verified that traveling to t can require no more than an 
additional 3d(s,t). Thus the robot's path is 0{m) times d(s,t)Tas required. I 
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Chapter 4 



The Robot Localization Problem 



A fundamental task for an autonomous mobile robot is that of localization — determining its 
location in a known environment [CoxFTPBHSSTWang]. This is also a problem well-known in 
everyday lifer where it can be surprisingly easy to become lost even with a compass and some 
knowledge of the terrain — and where it is sometimes crucial whether or not the map one is 
using bears a little mark with the wordsr"You are here." 

Thus we are dealing with a robot at an unknown location in an environment for which it 
does have a map. For our purposesrthe environment E that we consider will generally be the 
interior of a large polygon filled with a finite number of polygonal obstacles. We also assume E 
is connectedrso that there is an obstacle-avoiding path between any two points of E. A robot — 
assumed throughout to possess visionFa map of £Fand knowledge of its orientation — "wakes 
up" at some point in E; its goal is to determine where it is. Since we are assuming the robot 
knows its orient ationFwe will sayTusing somewhat non-standard terminologyrthat two regions 
are congruent if one can be obtained from the other by a translation (as opposed to a rigid 
motion). 

Fhe set of all points the robot can see from its current position forms a star-shaped visibility 
polygon P. If there is only one point in E from which the visibility polygon is congruent to 
Prthen the robot can immediately determine uniquely where it is. Otherwise — if it is in 
a highly self-repeating environment such as a typical large building — there are a number of 
different places the robot could be. GuibasFMotwaniFand Raghavan [GMR] considered the 
question of enumerating all possible locations for the robotTgiven a visibility polygon PFwhen 
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the environment E is itself a simple polygon. They gave a "single-shot" algorithm which runs in 
time 0(|P||£|)ras well as more complicated data structures for answering "localization queries." 

In a strong senserthoughr enumerating the possible locations is only a first step towards 
solving the localization problem. For if the environment is connected and robot begins moving 
around in £rbuilding up larger and larger partial maps P' D P of its "neighborhoodF there 
will come a time when there is only one point in all of E at which one can place a partial map 
congruent to P'; the robot has then determined exactly where it is. (We will say that it has 
localized.) 

One way to see that the robot will eventually localize is to note the straightforward fact that 
if it explores all of £rthen it can determine where it started from. But this is a crude solution; 
one would like to design an algorithm which causes the robot to travel as little as possible. 
From this point of viewFa good localization algorithm is constantly adapting its search based 
on the set of possible locations of the robotTand their position in the map. We can therefore 
phrase the following question: how should the robot move in E so as to determine its location 
efficiently? 



Figure 4-1: The infinite fence problem 

Although the question is quite precise — we can choose to measure efficiency in terms of 
total distance traveled — it is not immediately clear that it can have an interesting answer. 
Indeedrif we consider the polygon of Figure 4-lTa robot starting from the middle has no choice 
but to walk to one of the ends before it can determine where it is; and this is only a factor of 
two better than the trivial approach of exploring the whole environment. So in what sense can 
we say that one algorithm is significantly more efficient than another? 

We argue here that this is an ideal setting in which to use the competitive ratio. Specificallyr 
in the polygon of Figure 4-lTthe key point to observe is that the robot could not have done 
any better. Even if we had an omniscient robot that knew where it wasrit would have to travel 
half the diameter of the polygon before being able to prove that it started from the middle. In 
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this senseriocalization becomes an on-line problem: we define a localizing path to be any path 
followed by a robotTat the end of which it can determine uniquely where it isFand compare 
the distance an on-line robot travels before localizing to the length of the shortest possible 
localizing path. The ratio of these two distances is our performance measure for a localization 
algorithm; the worst-case value of this ratioFover all starting points in £ris its competitive 
ratio. 

We find the search for good localization algorithms interesting for two fundamental reasons: 

1. Focalization is a problem that arises in numerous practical robotics situations. 

2. The use of the map in helping to guide the robot's search introduces some complica- 
tions that are not generally found in on-line navigation problems. In this caserit means 
that straightforward on-line techniques such as spiral search are no longer asymptotically 
optimal. 

Applications in Robotics 

The localization problem has been considered in a wide variety of contexts in the robotics 
literature. One application is in the design of robot vehicles that must perform a certain task 
repeatedly in the same environment. HereFlocalization is used to determine the starting location 
at the beginning of the taskTand to maintain positioning information over time [CoxrDrrWang]. 
A similar use of localization is in analyzing aerial photographs to determine the location from 
which they were taken [YD]. 

Another major situation in which localization is used is in the design of autonomous ex- 
ploration vehiclesFsuch as the current prototypes for Mars rovers [MAWMTSN]. For exampler 
[MAWM] discusses strategies by which a mobile robot can determine its location after a short 
period of "reconnaissance": in the model considered thereFa rough global map of the Mar- 
tian terrain is known; the exploring robot relays local information back to Earthr where an 
obstacle-avoiding path is then planned. 

In a related veinr[TPBHSS] draws a distinction between localization algorithms for "update" 
problemsrin which there is some rough estimate of the robot's starting locationFand "drop-off" 
problemsrin which there is no initial information. In this terminologyFwe will be considering 
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the latter type of problem. Againr [TPBHSS] describes the role of "reconnaissance" in the 
localization problemr without providing any concrete algorithms. 

Theoretical Background 

The only previous theoretical treatment of the localization problem is the paper of Guibasr 
Motwanirand Raghavan [GMR]Tand as mentioned aboveHt deals only with the "static" version 
of the question. There has been no previous work on this problem from the perspective of 
competitive analysisr though there is a straightforward (and non-optimal) algorithm which is 
not difficult to describe; we will discuss this shortly. 

In what followsrwe deal with fairly simple types of environments in order to emphasize the 
combinatorial aspects of the problem rather than the geometric ones. ThusFwe consider the 
following two types of environments. 

• Bounded-degree trees embedded in R d . That isrthe vertices are realized by points in R d 
and the edges by line segmentsFand the robot is constrained to move on the edges and 
vertices. If T is an embedded treelVe will use n to denote the number of leaves it has. 

• Rectangle packings in the planer comprised of n rectangles [PYrBRaScrBBFYrBRiSi]. 
Recall from Chapter 2 that we assume all rectangles have at least unit thicknessFand that 
there is always just enough room for the robot to move between neighboring rectangles. 

Arguably the most natural way to design a localization algorithm for either of these environ- 
ments is the spiral search technique of Baeza-YatesFCulbersonFand Rawlins [BCR]. For our 
purposes herer spiral search could be implemented by having the robot iteratively explore all 
points of the environment within distance 1, 2,4, ..., 2 J , .. T until it knows where it is; the 
resulting algorithm will be O(ra)-competitive. 

In navigation problems for which no information about the map is known (such as those of 
the previous chapters)rit is frequently not difficult to show that spiral search is optimal up to 
constant factors. Indeed a great deal of work is often done to determine what these constant 
factors are [BCRrKRTrKMSY]. But the fact remains that no algorithm can be better than 
S7(n)-competitive for many of these problems. In some senserthis is not surprising; when the 
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robot knows nothing at ahTit is difficult to do much besides a brute-force search out to larger 
and larger radii. 

What we find interesting about the localization problem is that knowledge of the map enables 
the robot to begin focusing its search as it sees more and more of its surroundings; in particularr 
this will lead to a localization algorithm that is o(ra)-competitiveP improving asymptotically 
on spiral search. At a more general leveirwe are interested in on-line navigation problems in 
whichras is common in real applicationsr the robot has some limited information about its 
environment. Such problems tend to contain interesting structure that can be exploited when 
designing algorithmsFand often provide insight into the value of a map in performing navigation 
tasks. Another example of this is the &-trip shortest-path problem considered by Blum and 
Chalasani [BC]. Here the robot wishes to make k trips between points s and t while minimizing 
the average time per trip; thus it can make use of partial maps of the scene on later trips. For 
a further perspective on the value of different types of information in performing navigation 
tasksrsee the work of Donald on "information invariants" [Don]. 

The localization algorithm we present is quite natural and simple to state; the difficulty lies 
in analyzing the competitive ratio. The algorithm performs an initial period of spiral search on 
a local area which is sufficiently restricted to keep the competitive ratio from getting too large. 
At some point during this searchrthe robot is able to identify one or more "critical directions" 
in the environment; by searching only in these directionsrthe number of possible locations can 
be eliminated much more quickly. The algorithm then performs a final "clean-up" stagerin 
which the remaining possibilities are eliminated in an iterative fashion. In the case of treesrthis 
algorithm is 0(ra 2/,3 )-competitive. The analysis for rectangular obstacles appears to be quite a 



bit more complicated; howeverFwe are able to show a competitive ratio of (raw °f °^ n ) = o(n). 
In contrastTthe strongest lower bound we can show is i7(y / ra)rin both types of environments. 
Closing this gap remains an interesting open question. 

In independent workrDudekrRomanikrand Whitesides [DRW] give a geometric implemen- 
tation of the "shortest-distinguishing-paths" algorithm — which appears as Step 3 in the main 
algorithm of this chapter — when the environment is a simple polygon. Their algorithm is 
0(A;)-competitive when the robot initially cannot distinguish among k possible starting points 



Recall that f(n) is o(g(n)) when lim n ^oo f(n)/g(n) = 0. 
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(this number may be as large as S7(n)rwhere n is the number of vertices of the polygon). 

4.1 Lower Bounds and Other Examples 

We give two examples of trees embedded in the planerboth of which show that no localization 
algorithm can be better than i7(y / ra)-competitive on trees. In this and the following sectionr 
we assume that a robot on a tree cannot make use of vision — all it knows is the orientation 
of all edges incident to the vertex it currently occupies. Note that this can be simulated in a 
simple rectilinear polygon by introducing tiny bends everywhere to limit visibility; thusFany 
lower bound for trees immediately carries over to the class of simple polygons. 



Figure 4-2: Fhe first lower bound 

By a branch-vertexT we mean a vertex with degree at least 3. Fhe tree T in Figure 4-2 
consists of p blocksFeach of which has p branch- vertices and long "passages" to its neighboring 
blocks. Fhus the number of leaves in T is n = 2(2p — 1) + (p — 2)(2p — 2) = 2p 2 — 2p + 2. 
Fhe robot cannot localize until it has found one of the long passages; by placing it in the block 
whose passage corresponds to one of the branches it will search lastTwe can force the robot to 
travel 0(p) = Cl(y/n) times too far. (In factTthere is an algorithm which is O(y / ra)-competitive 
on this tree; this is not difficult to see.) 

We now turn to a second £l(^/n) lower bound for treesFwhich is somewhat more subtle to 
analyze and also has the advantage of extending to rectangular obstacles. Fhe tree T h is shown 
in Figure 4-3; it consists of a path II of length raFwith a path growing north from each vertex. 
All paths but the middle one have length h; the middle one has length h + 1 (the value of h 
will be fixed later). Observe that T h indeed has n leaves; and the key observation is that the 
robot cannot localize until it has reached one end of IITor traversed the middle path. 
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Figure 4-3: The second lower bound 



First we argue that if h = [v^J T then no algorithm on T h can be better than £l(^/n)- 
competitive. Let Vi, . . ., v h be the vertices of II that lie between h and 2h steps to the west of 
the entrance to the middle path. We place the robot at one of these verticesFsuch that it will 
traverse all the h other possible north-leading paths before finding the middle one. Thusrit 
will travel at least h 2 > n — 2^/n before finding the end of the middle path; to reach either end 
of Iirit must travel at least n/2 — 2^/n. Either way this is a distance of 0(n). Meanwhilerthe 
off-line adversary need walk a distance of at most "i^/n to reach the end of the middle path. 

ConverselyFlioweverrone cannot improve this lower bound by varying the value of h. Assume 
the robot starts somewhere on the path lira distance d from the entrance to the middle path, 
(if it starts elsewhererthe arguments are essentially the same). If h < y^rthen the robot can 
apply the two-way spiral search algorithmrtraveling all the way up each new path it encounters. 
Fhis algorithm is 0(/i)-competitive. And if h > y^rthen the adversary must travel at least a 
distance of y/n to localizeFso the robot can ignore all the north-leading paths and simply apply 
the two-way spiral search algorithm to the path II until the nearer end is reachedrtraveling a 
distance 0(n). 

Fhis latter case suggests some of the difficulty inherent in obtaining a o(ra)-competitive 
algorithm — when h = rarthe full spiral search algorithm is S7(n)-competitive; it is necessary 
to identify the path II as somehow being more important than the profusion of paths leading 
north. We will see a general technique for doing this in the next section. 

In closing this sectionFwe simply note that it is easy to construct the second lower bound 
out of 0(n) rectangles; see Figure 4-4. We use very long rectangles growing north and south 
so that it will take the robot too long to try completely leaving the middle path II. Each of 
the paths growing north out of II is now simulated by two of these very long rectangles with a 
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Figure 4-4: Making the second lower bound work for rectangles 

shorter one in between; also II now bends slightly so that the robot cannot see all the way down 
it. Thus no algorithm can have a performance guarantee better than £l(^/n) for environments 
of n rectangular obstacles. 

4.2 The Algorithm for Trees 

Recall that by a geometric tree T we mean a pair (V, i?)rwhere V is a finite point set in R d 
and E is a set of line segments whose endpoints all lie in V. The segments of E intersect only 
at points in Frand they do not induce any cycles. 

As beforer vertices with degree greater than 2 in T will be called branch-vertices. It will 
turn out that the degree-2 vertices of T are largely unimportant in the algorithm; thus we 
change our definition of T to an equivalent one without such vertices. Specificallyrwe will say 
that all the vertices of a geometric tree T are either leaves or branch- verticesr and edges are 
now polygonal paths between the vertices. Moreover IVe assume for the sake of simplicity that 
T has bounded degree; namelyrfor some absolute constant AI"at most A segments in E are 
incident to any given vertex in V . So we again change notation slightly by saying that T has n 
branch-verticesrand consequently has at most (A — 2)ra + 2 < An leaves (and at least n + 2). 

If U C VTwe use T(U) to denote the subtree induced by U . When U = {x, yjTthis is simply 
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the path from x to y; we denote its length (as a polygonal path under the Euclidean distance) 
by d T (x.y). Finallyr recall that the robot is constrained to move on the vertices and edges of 
TTand can make no use of vision other than to know the orientation of all edges incident to its 
current location. 

Consider a geometric tree T with n branch- vertices. We wish to prove the following. 

Theorem 4.1 There is an algorithm which is 0(n 2 ^ 3 )-competitive for the localization problem 
on geometric trees. 

To prevent various pathologiesFwe assume that all the points in V have rational coordinatesr 
and that the minimum length of any edge in E is 1. (These assumptions can be avoided at the 
cost of more cumbersome definitions below.) 
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Figure 4-5: Fining up T" with T 

We can assume without loss of generality that the robot begins at one of the vertices of 
T (i.e. at a leaf or a branch- vertex)T rather than in the middle of one of the edges. Fhis 
is simply because the robot can initially perform two-way spiral search to reach the closest 
vertexrtraveling no more than 9 times too far. MoreoverFwe can then assume that it in fact 
begins at a branch- vertexFsince its only choice at a leaf is to move along the incident edge. 

At all timesrthe robot maintains a search regionTsome geometric tree T" which consists of 
everything it has seen so far. Fhis means that it is in a part of T which locally looks like T"; 
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thus the basic computation the robot will be performing as it explores is that of "lining up" its 
current copy of T" with various parts of TTin the natural way. The robot maintains its current 
location a on the map of TTand a set D a of possible locations v that this might correspond to 
in T. (To keep the notation clearFwe will use Greek letters to denote vertices in T 1 .) If v G D a T 
then we will use Y ayV (T') to denote the subtree of T induced by rigidly placing T" on T so that 
a is mapped to v. See Figure 4-5. 

Of courserthe robot always does have a "genuine" location in T; it simply does not know 
what this is (until it has localized). Notationailynt is sometimes useful to refer to this unknown 
location: when the robot is at a £ TTwe will denote its true position in T by Z(a). (Note 
that Z(a) £ D a .) 

The following fact is immediate but will be used frequently. 

Lemma 4.2 For all a, (3 in T' , Dp is equal to the set D a translated by the vector from a to (3. 
In particular, \D a \ = \Dp\. 

As the robot performs its explorationRt remembers the branch- vertex in T" at which it first 
woke up; we will denote this vertex j . In the course of the localization algorithmrthe robot 
maintains four principal quantities of the search region. 

1. As we will see belowrT' is initially constructed using spiral search. Thus we maintain 
the current "search radius" r(T'); this is the distance out to which depth-first search is 
currently being performedrfrom the initial location 70 in T". 

2. The common value of \D a \ (as in Femma 4.2) will be denoted p(T'); this is simply the 
number of possible placements of T" in T. 

3. Fhe number of branch- vertices of T" will be denoted b(T'). 

4. Fhe quantity w(T') is defined to be 

max \D a f]Y av (T')\. 

Fhat isTif we pick a to be the "origin" of TTthen we can place it on v so that it covers 
w(T') other origins. 
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The algorithm consists of three main stepsr which are controlled by the following global 
structure: 

Initially b(T') = 1, p(T') = 0(n), w(T') = 1 
While b(T') < p(T') and b(T') < n/w(T r ) 

Execute Step 1 
If p(T') < b(T') then 

Execute Step 3 
Else execute Step 2 followed by Step 3 

In the remainder of this sectionrwe will describe each of the steps individual! yT then analyze 
the competitive ratio of the resulting algorithm. 

Step 1: Restricted Spiral Search 

The robot first wakes up at some initial location 70 G T" and begins performing spiral search 
[BCR]. This can be described as follows: the robot starts at 70 and performs successive depth- 
first searches so as to see all points within distance 2 J of Z(7 )rfor j = 0,1,2,.. .. 
The fundamental fact about spiral search is the following. 

Lemma 4.3 At the end of Step 1, the robot has traveled no more than 8Ab(T') times the length 
of the optimal solution. 

Proof. Suppose the final search radius was r = 2 J . So the total distance traveled by the robot 

is at most 

i 

^2A6(T') • 2 8 ' < 2 j+2 Ab(T'). 

8 = 

Since Step 1 did not terminate when the search radius was equal to 2 J_1 Tany localizing path 
must travel a distance of at least 2 J_1 away from 70. The bound follows. I 

Step 2: Extending the Critical Path 

If P is a directed polygonal path and k > lTwe use P k to denote the path formed by joining 
together k copies of P in succession. We use P _1 to denote P with the edges presented in 
the reverse order. Let U = {w l7 . . .,u k } be a subset of the vertices of T. We will say that U 
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(a) 



(b) 



Figure 4-6: (a) A periodic path, (b) A comb tree. 

induces a periodic path if T(U) is a simple path and there is some polygonal path P such that 
T(ui, u i+ i) = P m ' for natural numbers m l7 . . .,m k _i. We will say that U induces a comb tree 
if T(U) consists of a periodic path on vertices {f l7 . . . , v k } distinct from U (the 6ase)rtogether 
with disjoint paths T(wj,fj) (the teeth)Tall of which are mutually congruent; the vertex f, will 
be called the support point of m,. (See Figure 4-6.) 

Recall that at the beginning of Step 2Fwe have an "origin" a of T" and a placement Y UiV of 
T" which covers at least w(T') other origins in T. We will use W to denote this set 

D a nY atV (T) 

of covered origins. 

Lemma 4.4 VF induces either a periodic path or a comb tree in T (and hence also in T' ). 

Proof. The proof is based on the following two claimsFwhose proofs are given in the Appendix 
of this chapter. 

Claim 4.5 Let T be a geometric tree, r,x,y,z G V. Suppose that T(r,x) is congruent to 
T(y,z), and T(r,y) is congruent to T(x,z). Then T(r,x,y,z) is either a periodic path or a 
comb tree. 
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Claim 4.6 Let T be a geometric tree and U a subset of the vertices. Suppose that there is some 
x G U such that for all other y,z G U , T(x, y, z) is either a periodic path or a comb tree. Then 
T(U) is a periodic path or a comb tree. 

Using theserwe prove the lemma as follows. Recalling that W = D a n Y ayV (T'), let us suppose 
that W = {wi, . . . , w fc }Lwith Wi = v. Consider any other tc,-, Wj with 1 < i < j < k. Since these 
belong to _D„Lthere is a path congruent to T(wi, w 8 ) emanating from WjLand a path congruent 
to T(wi,Wj) emanating from w,. Note that these two paths have a common endpoint (at the 
point Wi + (wi — Wi)-\- (wj — Wi)). Let u denote this common endpoint; then Wi, w,, Wj, u satisfy 
the hypotheses of Claim 4.5. This in turn shows that W and the distinguished point Wi satisfy 
the hypotheses of Claim 4.6rand hence T(W) is either a periodic path or a comb tree. I 

But suppose that T(W) is in fact a comb tree. Then by Lemma 4.2Lthere is a vertex a' G T" 
such that D a i contains the set W of support points of T(W) (which form a periodic path). In 
factLif we let v' denote the support point of v Lwe see that W C D a * P\ Y a i yV iTso 

\D a ,nY a ,y\ > \W\ > w(T'). 

ThusLby using a' instead of oTwe obtain a set W (satisfying \W'\ > w(T')) which induces a 
periodic path. We state this as the following extension of the previous lemma. 

Lemma 4.7 W can be chosen so that it induces a periodic path in T (and in T'). 

Suppose T(W) = T'(W) = P m is such a periodic pathLwith origin a. The robot moves 
to a; it is now standing somewhere in the middle of a long periodic pathLand can thus follow 
successive copies of P by moving in one direction (the forward direction)Land successive copies 
of P _1 by moving in the other (the backward direction). The robot moves in each of these 
directions until it determines the largest i and j for which it is possible to traverse P l in the 
forward direction and P _J in the backward directionLstarting from a. 

Locating a and tTand traversing P l and P _J Lconstitutes Step 2. Note the following facts. 

Lemma 4.8 In performing Step 2, the robot travels no more than 4 + w( ^" ) _ 1 times the length 
of the optimal solution. 
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Proof. Moving to a costs at most 2r(T r ). T 1 contains P m as a pathrand m > w(T')Tso the 
length of P (and hence P _1 ) is at most iLj . AlsoFeach copy of P uses up an additional 
branch- vertex of TFso the robot will traverse at most n copies of P and P _1 . Since each copy 
is traversed twice (the robot returns to a)rit travels at most 2r(T') + J?^,)-! • ^ n ^ ne other 
handrthe length of the optimal solution is at least r(T')/2T&s argued above. I 

Lemma 4.9 At the end of Step 2, p(T') < 2n/w'(T'). 

Proof. For v G D a and u any other vertex of TTwe will say that u is P-covered (resp. P _1 - 
covered) by v if by starting at v and following successive copies of P (resp. P _1 )rthe robot can 
reach u. By considering the path P m in Y UiV Twe see that the total number of vertices that are 
P-covered or P _1 -covered by each v G D a is at least w(T') (counting v itself). Reversing the 
names of P and P _1 if necessaryTwe can assume that the average number of vertices P-covered 
by a vertex v G D a is at least w(T')/2. 

On the other handrwe claim that no vertex of D a P-covers any other. This is simply 
because it is well-defined for each v G D a how far one can move along T following the edge 
sequence of P n ; thus if v P-covers fTthe robot can eliminate at least one of v or v' from its 
set of possible locations. 

This in turn implies that no vertex u G T is P-covered by more than one member of D a 
(if it were covered by tworthe one farther from u would cover the one closer to u). Since the 
average number of vertices covered by a vertex v G D a is at least w(T')/2rand each vertex is 
covered at most oncer 

\D a \=p(T')< 2n 



w(T' 



Step 3: Cleaning Up 

Once p(T') has become sufficiently smahTthe robot can finish its task by brute force. Specifi- 
callyrassume that it is currently located at an origin a in T". For each pair of vertices v, v' G D a T 
define their shortest distinguishing path to be the shortest path Q such that it is possible to 
traverse Q starting from v but not from v' (or vice versa). (This is the least one has to travel 
to tell v from v'.) 
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In Step 3rthe robot iteratively applies the following strategy. Over all v,v' G -D„rit chooses 
the pair with the distinguishing path of minimum length. By following this pathrthe robot will 
be able to eliminate either v or v' from D a Tand perhaps both. Meanwhilerthe optimal off-line 
algorithm must travel at least this farFsince there is no way to eliminate even a single vertex 
from D a otherwise. The robot then returns to a and begins the next iteration. As there are at 
most p(T') — 1 such iterationsFwe have proved 

Lemma 4.10 In Step 3, the robot travels no more than 2p(T') — 2 times the length of the 
optimal solution. 

The Global Structure 

Finallyrwe give an absolute bound on the competitive ratioFusing the lemmas above. Note first 
of all that the tree T h of Figure 4-3Fwith h = raFshows that the algorithm which simply applies 
Step 1 until p(T') < b(T') and then switches to Step 3 is no better than O(ra)-competitive. But 
introducing the option of Step 2 prevents the initial period of spiral search from going on for 
too long. The crucial fact is the following. 

Lemma 4.11 By the time b(T') exceeds n 2 ^ 3 , the robot will have stopped executing Step 1. 

Proof. For a G T" and v G _D„riet w av (T r ) denote the cardinality of D a n Y ayV (T') — that isr 
the number of origins in D a covered if T" is placed so that a corresponds to v G T. To prove 
the lemmarit is sufficient to show that if both b(T') and p(T') are greater than ra 2/,3 rthen some 
w av (T') is at least n 1 ^ 3 . 



For a given v G Triet 



and e„ = \E„\. Thusr 



E v = {a : v G D a } 



J2e v =b(V)p(V). 

v£V 



Now let us compute the sum S of w av (T r ) over all pairs (ot,v) such that v G D a . Fix such 
a pair (a,v). FirstTit contributes once to w av (T'). Alsorfor every other (3 G -E^T there is 
some u G T such that v corresponds to (3 in Y ayU (T'); thusrthe pair (ot,v) contributes once to 
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w au (T r ). Thus (a, v) contributes a total of e v to the sum S . Summing over all pairsFwe obtain 

a,v v 

Since T has n branch- verticesrthis value is minimized by setting each e v equal to 

±J2e v = -b(V)p(V) 



and thus 



5>«,„ = E e l> n - - 2 Hr?p(T>y = h(Tfp(T) 



By the pigeonhole principler applied to the b(T') choices of a and the p(T') choices for v G aT 
there is some pair v, a for which 

w a , v > h(T')p(T') > n 1 / 3 . 

as desired. I 

First suppose the robot goes directly from Step 1 to Step 3. When this transition happensr 
b(T') < n 2 l 3 and p(T') < n 2 / 3 Tso by Lemmas 4.3 and 4.10rit travels at most 0(n 2 l 3 ) times the 
length of the optimal solution. 

Otherwiserthe robot goes from Step 1 to Step 2rthe transition occurrring when b(T') and 
n/vj(T') are at most ra 2 / 3 . Thus by Lemmas 4.3 and 4.8Lit travels at most 0(n 2 l 3 ) times too 
far in Steps 1 and 2. NowLLemma 4.9 implies that it will begin Step 3 with p(T') < 2ra 2/,3 Lso 
by Lemma 4.10Lit travels only 0(n 2 ^ 3 ) times too far in Step 3 as well. As these are the only 
two casesLthis completes the proof of Theorem 4.1. 

4.3 The Algorithm for Rectangles 

AgainL to keep complications related to visibility to a minimumL we work with a rectangle 
packing [BRaScLBBFYLBRiSi]Las defined earlier. By a vertex of the environmentTwe will 
mean a corner of some rectangle. So in the spirit of the previous sectionFone could picture 
a planar graph embedded in the two-dimensional integer gridrall of whose bounded faces are 
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rectangles. 

The algorithm for the case of rectangular obstacles is very similar to the one for trees; the 
main difference is the lack of an analogue to Lemma 4.4 to provide the robot with an obvious 
critical path to explore. As a resultLthe transition from Step 1 to Step 2 cannot happen as 
early as in the algorithm of Section 4.2; ratherrthe robot waits until the spiral-search branching 
factor becomes too larger and then begins exploring several critical paths in succession before 
beginning Step 3. We elaborate below. 



Let X(n) = ,/ , lo , Rn . The set of n rectangles will be denoted RT and the current search 

v ' y log log n ° 

region R' . b(R') will now simply be a measure of the number of rectangles in R 1 (including 
partial rectangles). Other notation is as before. The global structure of the algorithm is as 
follows. 

Initially b(R') = 1, p(R') = 0(ra). 
While b(R') < ^ 

execute Step 1 
While p(R') > ^ 

execute Step 2 
Execute Step 3 

Step 1: Restricted Spiral Search 

ActuallyLthis is not nearly as straightforward as it was for trees. We would like the robot to 
iteratively explore all parts of the environment within distance 1,2,4,... until b(R') gets too 
large. The problem is that the robot must be careful to keep track of the distance to each 
point it encounters; that isLit must know a shortest path back to the origin 70 for each point it 
reaches. FortunatelyLthere is a "compact search" subroutine due to BetkeLRivestLand Singh 
[BRiSi] which accomplishes just this. 

Thus Step 1 will proceed as follows. For successive values of 2 J (j = 0, 1,2, . . .) the robot 
explores all points within 2 J of Z(j ). We implement stage j of this process using a simple 
modification of the compact search algorithm of [BRiSi] — the robot turns back whenever it is 
about to move more than 2 J from 70. 
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Lemma 4.12 At the end of Step 1, the robot has traveled 0{b(R')) times the length of the 
optimal solution. 

Proof. The main result of [BRiSi] is that the robot will travel at most 10 times the total 
length of all edges in the region searched. In stage jTeach rectangle (or partial rectangle) has 
perimeter at most 4 • 2 J and there are at most b(R') rectangles; thus the robot travels at most 
AOb(R') -2 J . Summing over all stagesrthe distance traveled is at most 80r(R')b(R r ); meanwhiler 
the optimal solution has length at least r(R')/2. I 

Step 2: Extending Multiple Critical Paths 

Let c(R') denote the quantity 

min min d R (v,v'). 

(a£R') (v,v'£D a ) 

That isTc(R') is the smallest distance between two vertices in R corresponding to the same 
vertex in R 1 . Let P denote the polygonal path from v to v' . As in Section 4.2Lthe robotL 
starting from v Ltries to follow as many copies of P in succession as possible. After thisLc(i?') 
and the values of v, v' are updatedLand the robot iterates. Step 2 comes to an end when p(R') 
gets down to tt^t. 

To bound the distance traveled in Step 2Lwe first prove a combinatorial lemma about trees 
with edge lengths. Let r be a tree with m vertices and maximum degree ALand let p be a 
length function on its edges. If v is a vertex of TTB d (v) will denote the set of all vertices within 
distance d of v and rad(r) will denoteLas usualLthe smallest d such that there exists a v with 
r C B d (v). 

Lemma 4.13 There exists a vertex v* of t for which 

log m 



B rad(r) ( V* 
Mm)2 



> 



A 



Proof. Let u be a vertex of r that realizes the radius; that isL B rad ^(u) D r. For i 
1, . . . , A(m) 2 Llet r, denote the set of all vertices in r whose distance from u is at most x( \ 2 
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rad(r )Tand to, = |r 8 |. Setting m = lTwe know that 

A(m)~ 



n 



rrii 

= TO 



8 = 1 ' X 

so by the pigeonhole principle there is some j for which ™ J+1 > log to. 

Consider cutting off the tree at distance x( 3 m]2 -rad{r) from MTand let e l7 . . . , e s be the edges 
that cross this boundary. Since T has maximum degree Arand there are only rrij vertices in 
TjTwe have s < Arrij; nowr since rrij + i > rrijlogmTone of the subtrees below some e k has at 
least log to/ A vertices in Tj + i — Tj. Thus we can let the vertex in the subtree below e k which is 
closest to u be v* in the statement of the lemma. I 

Lemma 4.14 In Step 2, the robot travels at most O(jt^) times the length of the optimal 
solution. 

Proof. Consider building a shortest-paths treer rooted at 7oTon the vertices of the search 
region R' . (Note that we can build such a tree since we used a compact search algorithm in 
Step 1.) This tree has maximum degree four (since the obstacles are rect angles )Tsor applying 
Lemma 4.13Twe see that there is some vertex a* with at least |(log n — log A(ra)) vertices within 
a radius r S ■! • Now suppose that there are not two vertices v,v' G D a » for which 

2r( N') 
<«-')< ^. (4.D 

Then we could pack into R a collection of disjoint balls of radius r S n -l Teach of which contains 
at most one member of D a * and at least |(log n — log A(ra)) vertices of R. But this would imply 

, „ , / ,x 4ra An 

D a , = p(R') < — - < — - 

log n — log X(n) X(n) 

and thus the robot would not execute Step 2 at all. 

Thus we have shown that as long as b(R') > jj^- and p(R') > ^rTthere will be vertices 
v,v' G D a * satisfying Equation (4.1). As beforeriet P denote the polygonal path from v to v'T 
and suppose that the robot traverses P s beginning at Z(a*) but is not able to traverse P s+1 . 
We will call this a short iteration if s < A(ra)Tand a long iteration otherwise. 
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Observe that a short iteration eliminates at least one of v , v ' from _D„.Lso there are at most 
n short iterations. Alsorthe length of P is at most ,/ ,, Lso the robot travels at most r .\ J in 
one such iterationrfor a total of -4^ • r(R'). 

Now we claim that there can be at most one long iteration in Step 2. Indeedrat the end 
of such an iteration each v G D a * P-covers at least \{n) verticesLso by arguments strictly 
analogous to those in the proof of Lemma 4.9rthere can be at most — ^- vertices in D a *. Thusr 
Step 2 will come to an end after the first long iteration. MoreoeverLs is always at most nLso 
the robot will travel at most ?™. 2 ■ r(R') in this iteration. 

As the length of the optimal solution is at least r(i?')/2Lthe bound follows. I 

Step 3 is implemented just as before. Fhusrby Lemmas 4.12F4.14Fand the analogue of 
Lemma 4.10 for rectanglesFwe have 

Theorem 4.15 The above algorithm is O(jj^t)- competitive for the localization problem in an 
environment of n rectangles. 



4.4 Placing Unique Landmarks 

Until nowFwe have been considering the "drop-off" version of the problem [FPBHSS]rin which 
the robot is placed in an environment with very little starting information. But another sit- 
uation in which localization arises is that of a robot which must repeatedly perform tasks in 
the same environmentLand must begin by determining its current location. In such situationsL 
it is useful to place k unique landmarks in the environment L so that the robot immediately 
knows where it is upon encountering one of them. It is not difficult to make these notions 
precise in our model. Let us simply say that a k-marking of the environment E is a function fj, 
from the vertices of the environment to the set {0, 1, . . . , k}; exactly one vertex gets each value 
j = 1, . . . , fcLand the rest get the value 0. Each time the robot gets to a new vertex v Lit can 
determine the value fJ,(v). 

For the sake of concretenessLlet us consider the geometric trees of Section 4.2. Fhe goal here 
isLfor fixed A;Lto give a A;-marking fj, and an accompanying localization algorithm which achieves 
the lowest possible competitive ratio. Frying out this notion on the examples of Section 4.1L 
one finds that different environments can have strikingly different behaviors with respect to this 
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measure. For exampleFone must place at least y/n — o(y/n) landmarks in the tree of Figure 4- 
2 before bringing about an asymptotic improvement in the best competitive ratio attainable 
(since there will have to be a landmark in ah but a o(l) fraction of the \fn "blocks"). On the 
other handrby placing a single landmark at the exact center of the long path II in Figure 4-3r 
one brings the best attainable competitive down from y/n to 9 (the robot simply uses two-way 
spiral search until it hits this landmark or one end of II). 

In light of thisFwe believe that the algorithmic question of finding the optimal A;-markingr 
given a tree T or a set of rectangles i?ris very interesting; we leave it as an open problem. In the 
remainder of this sectionFwe turn to statements that can be made in general for environments 
of trees and rectangles. 

Proposition 4.16 For each tree T, there is a k-marking and a localization algorithm which is 
0(j)- competitive. 

Proof. Fhis is not difficult to prove directly; and using a lemma from [FoSjTwe can actually 
prove the stronger statement that there isrin effectTa single marking which works for all k. 
Specihcallyr it is proved in [FoS] that there is a numbering ip of the n vertices of T (i.e. a 
bijection from V to {l,...,n}) so that for each kTthe removal of the vertices numbered 1 
through k results in a forest in which no component has more than ^ vertices. 
Given ipTwe define the A;-marking fj, k in the natural way: 



Vk[v) 



ip(v) if 1 < ip(v) < k 
otherwise 



Given this markingrthe localization algorithm is rather unsubtle: the robot performs spiral 
search until p(T') decreases to 1 or it reaches a landmark (at which point p(T') immediately 
equals 1). Since it is traveling in a component with at most ^ verticesr&(T') will never exceed 
^-Tand the result follows. I 

UnfortunatelyFwe see that no stronger statement can be made at this level of generality 
for treesr since the trade-off in Proposition 4.16 is tight up to constant factors for the tree of 
Figure 4-2 when k = \fn. For the case of rectangular obstaclesFwe can prove a similar trade-off 
non-constructively. 
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Proposition 4.17 For each environment R of n rectangles, there is a k-marking and a local- 
ization algorithm which is 0( nlc ^ n ) - competitive. 

Proof. Note that the statement is trivially true if k is not at least fi(log ra )Tso we will assume 
that it is in what follows. AlsoFwe only consider markings in which landmarks are placed at all 
four corners of k/4 rectangles. Thusrby abuse of notationFwe will also speak of ^(it^T where 
Ri is a rectangle. The localization algorithm will be the same as in Proposition 4.16: perform 
spiral search as in Step 1 of Section 4.3 until p(R') decreases to 1 or a landmark is reached. To 
prove the stated boundrwe must show that for some marking fj,Tth.e robot will travel no more 
than 0( nl ° Rn ) times the length of the optimal solutionFregardless of its starting position. 

In factTwe claim that if fj, is constructed by randomly marking k/4 rectanglesrit will have 
this property with high probability. To prove thisFwe define a numbering ip v of the rectangles 
for each of the An vertices in the environment: ip v will tell the order in which the rectangles 
are encountered when the spiral search algorithm is performed beginning at v (i.e. ip v (Ri) — 1 
rectangles are encountered before rectangle i? 8 Tstarting from v). Say that fj, has Property $ if 

Sn In n 
Vv 3Ri : fi(Ri) > and ^„(iE,-) < — - — . 

The probability that fj, fails to have Property $ for a single vertex v is at most 

^Slnny < e -2in„ = l 
k J n 2 

and so the probability that fj, fails to have Property $ for any vertex is at most -. 

So consider a marking fj, which does have Property $. Regardless of the robot's starting 
locationrit will encounter a landmark by the time b(R') reaches 8n ^, nn ; the result now follows 
from Lemma 4.12. I 

Appendix: Proofs of Claims 4.5 and 4.6 

Recall that a subset U = {ui, . . . ,u k } of the vertices of T induces a periodic path if T(U) 
is a simple path and there is some polygonal path P such that T(ui, u i+ i) = P m ' for natural 
numbers m l7 . . .,m k _i; and U induces a comb tree if T(U) consists of aperiodic path on vertices 

63 



{vi, . . . ,v k } distinct from U together with disjoint paths T(ui, fj)Lall of which are mutually 

congruent. 

Proof of Claim 4.5. Let S denote T(r, x, y, z)YP denote the polygonal path T(r, x) ~ T(y, z)T 

and Q denote T(r,y) ~ T(x,z). Let us suppose by induction that the claim holds for all 

examples in which the total number of edges in P and Q together is smaller than in S (note 

that the result clearly holds when P and Q each consists of a single edge). We consider three 

separate casesLbased on the number of leaves in S. 

Case 1: S has two leaves; suppose that these are r and z (other cases are similar). Then 
since C has no branch- vertices in this caseLthe polygonal paths PQ and QP are congruent. It 
is straightforward to show that this implies PQ is a periodic path. 

Case 2: S has three leaves. This is actually impossible; suppose that xTyTand z are all 
leaves (other cases are symmetric). Then since a; is a lealTP -1 and Q have the same initial 
direction; since z is a lealTP -1 and Q _1 have the same initial direction; and since y is a lealTP 
and Q _1 have the same initial direction. Thus P and Q have the same initial directionLwhich 
implies that r must be a leaf. 

Case 3: S has four leaves. Then PLQLP _1 Land Q _1 all have the same initial direction. 
We eliminate the shortest of these four initial edgesLand shorten the other three initial edges 
by the same amount. In this wayLwe have an example S' with one fewer edge in P or QT so 
the claim holds for S'. This in turn implies that S is a comb tree. ■ 

Proof of Claim 4.6. If all triples form periodic pathsLthen clearly T(U) is a periodic path. 
So suppose T(x,y,z) is a comb tree. Then since T(x,y) has the same direction at both endsL 
T(x, y, z 1 ) must be a comb tree for all z' G U . Now if T(x,p, q) is a periodic pathLthen T(x,p) 
cannot have the same initial direction at both ends; thus T(x,p,q') is a periodic path for all 
q' G U . From this it follows that if any triple constitutes a comb treeLthen all triples do. AlsoL 
we can conclude that all vertices are leaves in T(U). 

NextLobserve that for all z, z' G UTx has the same support point in T(x, y, z) and T(x, y, z') 
(it is the maximal sequence of line segments which is the same at the beginning and end of 
T(x,y)). ThusLit has the same support point in ah comb trees. We now conclude that all the 
vertices in U have disjoint congruent paths joining them to the rest of T(U). Deleting these 
pathsLwe obtain \U\ support points joined by periodic paths. Thus T(U) is a comb tree. ■ 
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Chapter 5 



Real-Time Server Algorithms 



The trade-off between the computational resources of an on-line algorithm and the competitive 
ratio it can achieve is a question that has received relatively little attention. In some senserit 
is not hard to see why this should be the case: proving lower bounds for traditional algorithms 
is notoriously difficultr and introducing the on-line/off-line distinction only seems to add to 
the complications. Thus establishing a non-trivial relationship between two measures such as 
running time and competitive ratio appears to be mainly beyond the reach of current techniques. 

Neverthelessnt seems worth considering the problem at least for the simplest kind of on-line 
algorithms. What we have in mind is the following. Consider an on-line problem that can be 
expressed in the framework set up in Chapter 2: an algorithm A is presented with a sequence of 
requests a = {ci, <7 2 , . . .} in order; when c; is presentedlM must perform some computation. We 
are interested in considering algorithms A that can meet some fixed time bound in processing 
each request; in particularly should not slow down as more and more requests are presented. 
Thusrwe (informally) define a real-time algorithm to be one which uses a constant amount of 
spacer and constant time per request <7j. Of courserthe notion "constant" will depend on the 
particular problemrbut the running time of A should in any case not depend on the number 
of requests seen so far. 

The 2-server problem appears to be a good one on which to try out these notions. First 
of alir a fair amount is known about the bounds one can achieve — there are at least three 
optimal 2-server algorithms in the literature [MMSrCLirCL4] — and a number of constant- 
time algorithms have been analyzed [IKTCL2rCKPV]. Alsorthe structure of the 2-server 
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problem is in some sense quite a bit simpler than the structure of the general A;-server problem. 

In this settingra real-time algorithm is one which decides which server to send to each 
request using an amount of time and space independent of the cardinality of the metric space 
and of the number of points requested so far. None of the known optimal 2-server algorithms 
have this property; for examplerin an infinite metric space such as R d Tthey slow down with 
each additional request. Indeedrcurrently the best real-time algorithm known has a competitive 
ratio of 4 [CL2] (improving on a 10-competitive algorithm [IR]); and until nowrthe only known 
metric spaces in which a fast algorithm could achieve the optimal ratio of 2 were those which 
could be embedded in a tree. 

All of this leads to a very interesting open question: is there a real-time 2-server algorithm 
with a competitive ratio of 2? We do not settle this question here; howeverFwe give two 
results which provide evidence in opposite directions. Note also that the question is essentially 
independent of the A;-server conjectureFwhich asks how well an on-line algorithm can perform 
regardless of the amount of computation it requires. 

First we present a real-timeFmemoryless 2-competitive algorithm for two servers in R d rfor 
any dimension Sunder the L\ ("Manhattan") metric. Fhis result considerably extends the class 
of metric spaces for which there is known to be a fastToptimal algorithm. Moreoverrthe original 
real-time 2-competitive algorithm for trees was based on a very elegant memoryless strategyT 
"Double- CoverageF that was first discovered for servers on a line [CKPV] (this algorithm 
was discussed in Chapter 2). HoweverFattempts to extend this technique to produce optimal 
deterministic algorithms in spaces more general than trees had so far not been successful. Fhusr 
an interesting feature of our main result is that it follows the style of Double- Coverager and 
can therefore be viewed as an extension of that algorithm to the case of two servers in higher 
dimensions. 

In Section 5.2Fwe turn to the question of lower bounds. A number of papers have proposed 
"balancing" algorithms for the A;-server problem. Fhe basic balancing algorithm works as 
follows: for each server s 8 Tits total distance traveled is maintained in the variable _D 8 -; when 
a request is made at a point rrthe algorithm sends the server which minimizes D, + rs, (for 
x,y G MTlet xy denote the distance between them). Fhis rule was shown to be ^-competitive 
for k servers when the cardinality of the request space M is k-\- 1 [MMS]Tand for the "weighted- 
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cache" problemr which includes the paging problem as a special case [CKPV]. 

Howeverrthe algorithm does not achieve any constant competitive ratioFeven for two serversr 
in a general metric space M. Thus it was somewhat surprising that a rule minimizing the 
quantity D, + 2rs 8 - was shown to be 10-competitive for two servers [IR]. A later construction 
showed that this algorithm is no better than 6-competitive [CL2]. 

Hererwe show a new lower bound for the class of balancing algorithms in general. Let 
/ : R + — ► RTand Bj be the server algorithm which does nothing when the request point is 
already coveredrand otherwise moves the server which minimizes D, + /(rs^Twhere D, is the 
total distance traveled by server i. We will describe Bj as a balancing algorithm with cost 
function /. Observe that we make no restrictions whatsoever on the nature of the function /. 

Our main result is a lower bound of (5 + \/7)/2 (~ 3.82) on the competitive ratio of any 
such balancing algorithm for two servers. In view of the 2-competitive algorithms of [MMSr 
CLirCL4]rthis shows that no optimal on-line 2-server algorithm can be expressed as a decision 
rule Bf for any /. To our knowledgerthis represents one of the first lower bounds for on-line 
algorithms based solely on computational resources. 

We use the following notation. For a server algorithm yiriet T(A) denote its competitive 
ratio. If A is c-competitive for some (Twe will say it is competitive; otherwisel 1 we write 
T(A) = oo. As beforer OPT will denote the optimal off-line server algorithml 1 whose cost 
on a request sequence a is always the minimum possible; we sometimes refer to OPT as the 
"adversary" algorithm. 

5.1 Two Servers in Euclidean Space 

Let x l denote the i th coordinate of x G R d . For two points x,y G Redefine T d (x,y) to be 
the closed d- dimensional box (possibly degenerate) with x and y as opposite corners. That isr 
w G R d lies in T d (x, y) if and only if w l is between min(a; 8 , y l ) and max(a; 8 , y l ) for i = 1, . . ., d. A 
point z G R d induces the following partial order < z on R d : x < z y if and only if x is contained 
in the box T d (y,z). It is easily verified that < z satisfies the properties of a partial order. 

First consider the following problem. We have two sets P and QTeach consisting of n points 
in (R d , ii)rand we wish to find a minimum-cost perfect matching between them. The following 
lemma is central to the analysis of the algorithm. 
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Lemma 5.1 For each q £ Q, there is a minimum- cost matching M in which q is matched to 
one of the points of P which is minimal with respect to < q . 

Proof. Consider a matching M 1 in which p' £ P is matched to grand p' is not minimal with 
respect to < q . In this caserthere is some minimal p £ P such that p < q p'; suppose that p is 
matched to some q' £ Q. We can draw a shortest path from p' to q that passes through p; thus 
the matching M in which p is matched to q and p' is matched to q 1 has cost at most that of 
M'. I 

The Algorithm in the Plane 

We first give the algorithm in the planeFand then show how to extend it to higher dimensions. 
We number the quadrants of the plane in clockwise orderrbeginning with the positive quadrant. 
As is standard in analyzing server algorithmsFwe view the request sequence as a game between 
the algorithm and an omniscient adversary. In each round of the gamerthe adversary generates 
a request and serves it optimally (using knowledge of the future requests). The algorithm then 
must also serve the requestT without knowing anything about the behavior of the adversary 
servers or the remainder of the request sequence. 

Assume that the algorithm's servers are at points Si and s 2 rthe adversary servers are at 
points t>! and o 2 Tand a request comes to some point in the plane. Since the L\ metric is 
invariant under translations and 90-degree rotationsFwe can assume that the request is at the 
origin and that the on-line server closer to the requestT^ Tries in the first quadrant. 

FirstTthe algorithm computes the partial order ^Twith respect to the originFon the two 
servers. Clearly s x will be a minimal element. If s 2 is not also minimairwe simply move Si 
along a shortest path to the origin. Otherwiserboth are minimal and we have the following 
cases: 

1. s 2 lies in the third quadrant. Here we move both servers at constant speed along shortest 
paths to the origin until Si reaches it. 

2. s 2 lies in the second quadrant. HereFwe move both servers at constant speed to the 
positive y-axis — if s 2 reaches it firstrwe also move it down the y-axis — until s 2 is no 
longer minimal. 
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3. s 2 lies in the fourth quadrant. This is strictly analogous to the previous case. 

4. s 2 also lies in the first quadrant. Then consider the rectangle r 2 (si,s 2 )rand let y be the 
corner of this rectangle closest to origin. We move both servers toward y until Si reaches 
it; now s 2 is no longer minimal. 

Theorem 5.2 The above algorithm is 2-competitive in the L x plane. 

Proof. We analyze the algorithm's performance using the CDRS potential function [CDRS] 

$ = M2 + 2M min (5,OPT), 

where M m [ n (S,OPT) denotes the value of the minimum-cost matching between the adversary 
servers and the on-line serversFand SiS 2 isT&s beforerthe distance between Si and s 2 . By the 
standard potential-function argumentTthe following two facts will imply that the algorithm is 
2-competitive. 

1. When the adversary movesFpaying (Tit can raise $ by at most 2c. 

2. When the algorithm movesFpaying cTit lowers $ by at least c' . 

The first of these facts holds in any metric spaceFand has been proved in earlier papers [CKPVr 
CL3rCDRS]; the argument is simply that the value of the minimum-cost matching goes up by 
at most c' while the first term of $ is untouched. 

Now consider the second fact. We break the behavior of the algorithm into two phases. 
In the first (possibly empty) phaserboth servers are moving; in the second phaseFonly one 
server is moving. In the second phaserin which one server moves a distance cTthe value of first 
term of $ increases by at most cTwhire Femma 5.1 implies that the value of the minimum-cost 
matching goes down by d. Thus $ goes down by at least dT&s desired. 

When the algorithm is moving both servers (each a distance d') the matching component 
cannot go up (whichever server is matched to the origin is decreasing its contribution to the 
cost optimally). Meanwhiler examining the four cases aboveFwe see that the first term of $ 
goes down by 2d'. Thusr<I> decreases by at least 2rfTand the proof is complete. I 
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Under the Euclidean metricrthe on-line servers can follow the above algorithm and pay at 
most the L\ cost. Meanwhilerthe adversary only has to pay the T 2 costLwhich can be smaller 
by at most a factor of y2- Thus we have 

Corollary 5.3 The algorithm is i l\/2- competitive (~ 2.83,) in the Euclidean plane. 

The Algorithm in d Dimensions 

Consider now the case of two servers in Rounder the L\ metric. As beforerby a translation 
of R d Lwe may assume that the request is at the origin; we also relabel the servers if necessary 
so that Si is closer to the request. 

The algorithm in d dimensions works as follows. If only one server is minimairit moves 
on a shortest path to the origin while the other remains fixed. Otherwiserboth servers are 
minimal. Let Si = (s\, . . . , sf) and s 2 = (s^, . . . , s 2 ) in coordinates; we proceed in the following 
two stages: 

1. As long there is a j for which s[ and s J 2 have opposite signLwe move the servers at the 
same speed towards the hyperplane {x : s J = 0} so as to bring one of these values to 0. 

2. The servers begin the second stage with s[ and s 2 having the same sign for each j; that 
isrthey lie in the same closed orthant. By relabeling the axesLwe may assume this is the 
positive orthant. Let y be the corner of T d (si,s 2 ) closest to the origin; that isL 

y = (miR(s\,s\), . . . , minOf, s d 2 )). 

As beforeL denote the j th coordinate of y by y J . The servers repeatedly execute the 
following step: each picks a j for which s] — y J is strictly positiveLand moves at constant 
speed towards the hyperplane {x : s J = y J } so as to decrease s^ to yK Each such move 
reduces the number of coordinates j for which s\ — yi > for one of the servers s,. ThusL 
after at most d such stepsLserver Si will be sitting on y. At this pointLonly Si is minimalL 
and we proceed as above. 

Theorem 5.4 The algorithm is 2-competitive under the T x metric in R d . 
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Proof. We use the same potential function $. The analysis of the adversary's move and the 
case in which only one on-line server is minimal are handled as in the proof for two dimensions. 
When both on-line servers move a distance dTwe must show that the potential $ drops by at 
least 2d. This will be implied by the following property: 

(*) Both servers are moved a distance d closer to the origin and 2d closer to each other 
simultaneously. 

The motion of the servers in the first stage of the algorithm clearly satisfies this property. 
Observe that throughout the second stagerthere is a shortest path from Si to s 2 that passes 
through the corner y of T d (si,s 2 ). But each step in this stage brings a server simultaneously 
closer to the origin and to y; thus Property (*) is maintained in the second stage as well. I 

Corollary 5.5 The algorithm is 2\fd- competitive under the Euclidean metric in R d . 

5.2 A Lower Bound for Balancing Algorithms 

To prove the lower boundrwe begin by "classifying" the function / used in the balancing rule in 
a certain way. We will say that a property P of positive real numbers holds "e.f." (everywhere 
but in a finite interval) if 3x \/(x > x )P(x). Similarly P holds "a.l." (for arbitrarily large 
reals) if \/x 3(x > x )P(x). For any function / : R + — ► RTand p £ R+Tsay that / £ <f(p) 
if f(x) < px e.f. If for some such pTf £ Lp(p)Twe will write pj = inf{p : / £ (f(p)}; Pj = oo 
otherwise. Observe that / does not necessarily belong to Lp(pf)T since the infimum is not 
necessarily attained. 

Finallyrthe behavior of Bj is ambiguous when D, + f(rsi) = Dj + f(rsj) for i ^ jTand i,j 
both minimize this expression. The standard convention here is to let the adversary break the 
tie; in any eventrnone of our constructions rely on such degeneracies. 

For the remainder of this sectionFall server algorithms will be 2-server algorithms. 

Lemma 5.6 If Bj is competitive, then f{x) > e.f. 

Proof. Assume by way of contradiction that f(x) < a.l. and that Bj is c-competitive for 
some c. Choose x so that f(x ) < and N large enough so that N > cx and N > —f(x ). 
Since f(x) < a.lTwe can find an N > N such that f(N) < 0. 
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Figure 5-1: Metric Space Mi 

Consider the metric space Mi in Figure 5-irwith WX = YZ = x TWY = WZ = XY = 
X Z = N . Server Si begins on W and server s 2 on X . The first request is to the point Y. Since 
both servers are the same distance from YTwe may assume that Si responds. Now we request 
point Z; since 

N + f(x )>0>f(N), 

server s 2 wih respond. At this point r_Bf has paid 2N > 2ca; rwhile OPTTby using one server 
to cover requests to W, X and the other to cover Y, ZFneed pay only x . Moreover Mie situation 
is now symmetric to the beginningFso we can repeat the process indefinitelyr contradicting the 
claim the Bj is c-competitive. I 

Armed with this lemmalwe can prove a much stronger result Tgeneralizing the observation 
that the basic balancing algorithm is not competitive. 

Lemma 5.7 If Bj is competitive, then pj > 1. 

Proof. Assume by way of contradiction that pj < 1 and Bj is c-competitive. We have two cases 
to consider: / £ vK-Q an d / ^ ^(l)- 

If / G <^(l)rthen for some x > OrO < f(x) < x for all x > x . Consider metric space MiT 
with WX = YZ = x Tall other lengths equal to N = ca; rand the servers initially on W,X. 
Suppose we request point Y and Si responds. If we now request point Zrthen since 

N + f(x ) > N > f(N), 

server s 2 will respond. Moreoverrthis process can be continued indefinitely. Thus Bj pays 2cx 
in each round of this sequencer-while OPT need pay only a^Tas in the above arguments. 

If / g - <^(l)rthen we can still find some y > such that < f(x) < (1 + -^)x for all x > y. 
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MoreoveiTsmce / ^ Lp(l)Twe can find some x > y such that f(x ) > x /2. Construct metric 
space Mi with WX = YZ = a; rall other lengths equal to c^oTput Si on IFT^ on XTand 
request point Y . Suppose Si responds; we now request point Z. Since f(x ) > x /2T 

cx + f(x ) > (c + -)x = (1 + — )cx > f(cx ), 

the last inequality following since cx > y. Thus s 2 will respond. In this wayri?^ pays 2cx on 
this sequencerwhile OPT need only pay x . I 




Figure 5-2: Metric Space M 2 

Lemma 5.8 If p f = p > 1, then T(B f ) > p/2 + 3/2. 

Proof. We show that for each (3 > Orthere exist arbitrarily long request sequences a for which 
Bf(a) > (p/2 + 3/2 — (3)OPT(a); this will establish the lemma. We construct a countably 
infinite metric space M 2 as follows. The "core" of the space is a short segment XY of length d T 
chosen so that pd /2 < f(d ) < 2pd . The sequence a is built in phases; in phase jTj = 1, 2, . . T 
all the requests are to XTYTand a point Zj at distance dj from both X and Y. (See Figure 5-2.) 
Let Gj denote the sequence up through the end of phase j. 



73 



Let Dj denote YUZo <^Tand D 1 - denote the sum of Dj and the total distance traveled by 
both on-line servers in all previous phases. We fix some very small a > Orto be determined 
belowrand since pj = pTwe can choose dj such that Dj < ac^Tand f(dj) > (p — 0/2)dj. Phase 
j begins with the two on-line servers sitting on X and Y . First the point Zj is requested; then 
the points X and Y are requested repeatedly until the on-line server at Zj returns to the core 
XY; at this point the phase comes to an end. 

In this phaserone adversary server moves out to Zj and immediately returns on the next 
requestTfor a cost of 2dj. Meanwhiler the on-line server covering X and Y has built up a 
distance of no more than Dj from all previous phasesFso it must move more than 

dj + f(dj) - (Dj + f(d )) > dj + (p- fi/2)dj - (adj + 2padj) 

= (p+ l)dj - (2p + l)adj - (]3/2)dj 

before the server at Zj is selected to return. Fhusrthe on-line servers move a distance of at 
least 

(p + 3)dj - (2p + l)adj - (]3/2)dj 

in this phaser while the adversary servers move no more than 2dj + 2adj in all phases up to 
this one. By choosing a small enoughrwe can ensure that 

Bf(<Ti) (p + 3)-(2p+l)a-(/3/2) 

, A V > KF ' — v F ' V-l—L > p 2 + 3/2 - 3. 

A{(jj) ~ 2 + 2a ~ F/ ' ' 

Since we can continue this construction for an arbitrary number of phasesrthe result follows. 
I 

It is clear that a very similar construction involving metric space M 2 shows that if pj = ooT 
then Bf is not competitive. The final lemma is based on a construction in [CL2]. 

Lemma 5.9 If Pf = p > 1, then T(Bf) > 3p/(p — 1). 

Proof. In the style of the previous proofTwe show that for each > OT 
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Figure 5-3: Metric Space M 3 



Choose positive 8 < (3/2 and 



f <2£^.«. 



4p- 1 

Now take N large enough so that f(x) < (p + e)a; for all a; > N andrsince pf > p — eTwe can 
find a > N such that /(a) > (p — e)a. 

Consider the metric space M 3 of Figure 5-3. Triangles UVW and XYZ have sides of length 
a; the three edges crossing between the triangles have length 



b= (*E±±-6)a. 

\2p-2 



Put server Si on U and s 2 on V. We place the adversary servers on V and Z. AlsoFset 
Di = a/2 and D 2 = (this can be easily accomplished by starting one of the on-line servers 
from a seventh point in the metric space). 

We construct a request sequence as follows. First we request ZFwhich will be served by s 2 . 
We then request X -- as shown belowrthis will be served by Si. At this pointr_D 2 exceeds D x 
by a/2rand the situation is symmetric to the beginning. The optimal algorithm can use one 
server to cover each triangle and hence pays a; Bj pays 26 + cTfor a ratio of 



2b + a 2p+l 3p 

= 1 + -28 > - 

a p — 1 p — 1 



(1. 



Thus we need only verify that Si will serve the request at X . This follows because 



JK ' 2 J \\2p-2 J J ~ V 2p-2 F 2p-2 
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which is less than or equal to 

D 2 + f(a) = a + b + f(a) > a ^±^1 _ S - e ) . 

I 

Theorem 5.10 For all f, T(B f ) > (5 + y/1)/2. 

Proof. If pj < 1 or pf = ooTthen Bj is not competitive. In the case where 1 < pj < ooT 
Lemmas 5.8 and 5.9 imply that 

im)>max{^^,^±^}. 

This expression is minimized at pj = 2 + v^rwith a value of (5 + \/7)/2. I 
As noted aboveFwe have the following interesting corollary. 

Corollary 5.11 No optimal on-line 2-server algorithm can be expressed as Bj for any function 
f- 
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Chapter 6 



Conclusion and Open Problems 



6.1 Robot Navigation 

We have considered two basic on-line navigation problems: searching for a goal in an unknown 
polygonrand determining one's location in an environment for which the map is known (local- 
ization). A number of questions are left open by this work; we summarize some of these belowr 
along with a number of other possible directions for future work. 

1. To begin withrthere are gaps between the upper and lower bounds that we can show for 
the localization problem in both kinds of environments considered in Chapter 4. Certainly 
for the case of rectangular obstaclesrit seems that there should be room for improvement 
on the upper bound. Indeedrit is quite possible that essentially the same algorithm given 
here actually achieves a much better competitive ratio; it appears that a stronger analysis 
of the geometry of the rectangle packing is needed. 

Alsor as suggested in Section 4.4T the problem of placing landmarks brings up some 
interesting algorithmic questions. Here is one concrete decision problem: given a tree 
T and an integer AT does there exist a A;-marking fj, and an accompanying localization 
algorithm with competitive ratio at most 9? (9 appears to be the right constant to use 
hereras it is the best competitive ratio attainable for the search problem on a line.) 

2. The results of Chapter 3 apply only to the case of simple rectilinear polygons; i.e. those 
with no holes. [DKP] conjectures that a constant competitive ratio should be attainable 
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for the problem of exploring a rectilinear polygon containing an arbitrary number of 
rectilinear holes. (They give an algorithm which is 0(A;)-competitiver where k is the 
number of holes.) 

Since a number of variations on this problem are possibleriet us make precise what we 
mean: we want the robot (whether on-line or off-line) to traverse a path from which it can 
see all points in free space and on the boundaries of the obstacles. This question appears 
to be rather difficult (and the off-line version is NP-Completerby a simple reduction from 
the Traveling Salesman problem); perhaps a good starting point would be the special case 
of a large square containing an arbitrary number of unit squares as obstacles. 

Of courserone could also consider the search problem in this setting; it is not clear in this 
case what kind of bounds one would be trying to achieve. 

3. The problem of using multiple interacting robots effectively is one that comes up fre- 
quently in the larger robotics literature. Most of the previous theoretical work on multiple 
robots is somewhat orthogonal to these concerns; it has tended to deal with computation- 
ally limited robots that use one another mainly for "pebbling" purposes (see e.g. [BK] 
and the references therein). 

We are interested in the following sort of problem. One has k independent robots trying 
to solve a search problem in some geometric environment; the quantity being minimized 
in this situation is the time taken to find the goairrather than the total distance traveled. 1 
If the best competitive ratio attainable by a single robot in this environment is (Tcan one 
give algorithms which are 0(-7^r)-competitiver where f(k) is a reasonably fast-growing 
function? 

This question resembles some of the load-balancing problems that have been considered 
from the perspective of competitive analysis; essentially we are asking how "parallelizable" 
the search problem is in the on-line setting. 

4. Much of the appeal of Klein's street problem (Chapter 3) is in the definition of a street 
itself: it is a natural type of polygon in which a constant competitive ratio can be achieved 



For a single robot, time and distance tend of course to be interchangeable in most of these problems. 
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for the search problem. We are interested in looking for other types of polygons in which 
competitive search is possible. 

One obvious possibility is the class of star-shaped polygons (P is star-shaped if there is 
some point inside P that can see the entire boundary): can a robot starting at s find 
a point t while traveling a constant factor times d(s,t)1 A related and probably easier 
question is that of whether a constant competitive ratio can be achieved for the problem 
of finding a member of the kernel of P (i.e. the set of "star points" of P). 

Of courserstar-shaped polygons are a rather simple classrbut studying them may suggest 
more general algorithms. For exampler suppose we define the illumination number of 
P to be the minimum number of point-source "light bulbs" needed to provide direct 
illumination to the entire boundary of P from its interior (so P is star-shaped iff its 
illumination number is one); now we can look for algorithms that achieve a constant ratio 
for searching polygons with fixed illumination number. 

5. What does it take to build a maze? Specificallyr it is interesting to ask oneself what 
constitutes the difference between the "shortest-path" problems of [PYrBRaScFBBFY]r 
and the geometric search problems [KTK1] of the type discussed in Chapter 3. Initiallyr 
one is tempted to conclude that the difference is in whether the coordinates of the goal 
are known; butTas any good labyrinth-designer knowsrit is possible for the entrance and 
exit of a maze to be physically right next to each otherFand for the maze still to be very 
difficult . 

In other wordsFwhen the obstacles can be sufficiently complexrthere is no value in knowing 
the coordinates of the goal. We can make this more precise as follows. Fet T denote a 
family of rectilinear polygonsFwhich will be used as obstacles in the sense of [PY] (e.g. T 
could be the set of all rectanglesFor the set of all F-shaped polygonsFand so on). As 
usuairwe will require that all members of T have at least unit thicknessFand that they 
cannot be "glued together." Consider now the navigation problem in which the plane is 
filled with members of .7-Tand a robot starting at s is given the coordinates of a point t 
which it must travel to. Fet n denote the straight-line distance from s to t; recall that 
this could be much less than d(s,t)Tth.e length of the shortest obstacle-avoiding path. 
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Say that T is maze-inducing if for no function / is there an /(n)-competitive on-line 
algorithm; that isrthe competitive ratio for the navigation problem cannot be bounded 
in terms of the straight-line distance from s to t. Thusrthe results of [BRaSc] show that 
the class of rectangles is not maze-inducing. On the other handHt is not difficult to build 
mazes when T is the set of all rectilinear polygons (with at least unit thickness). So 
a natural question would be to try giving some characterization of classes T which are 
maze-inducing. For exampleris it true that no T consisting of polygons with a uniformly 
bounded number of verticesFor even a uniformly bounded illumination numberFcan be 
maze-inducing? Along these lines one could definer by analogy with the illumination 
number rthe segment number of P to be the minimum number of line- segment light sources 
needed to provide direct illumination to the entire boundary of P from its interior. We 
can show it is possible to build mazes when T is the set of polygons with segment number 
at most threerbut do not know whether it is possible when the segment number is at 
most two. 

Of courserthere are many other issues one could consider. As mentioned at the beginning 
of Chapter 4Fmost on-line navigation problems studied up to this point have maintained a 
fairly binary distinction between having a map and having no map. But many naturally arising 
robotics problems involve navigation when the robot has some limited information about its 
relation to the environment. Fhe localization problem is a step in this directionFas are the 
papers of Donald [Don] and Blum and Chalasani [BC]. 

As in the case of several other on-line problemsr there are numerous "design choices" one 
faces when formulating a navigation problem of the variety studied here. We believe it is 
important in this process to keep in mind the problems faced by designers of autonomous 
mobile robots. It is not necessary that the theoretical algorithms produced in this setting be 
directly implementable in contemporary robots; but they seem to be most valuable when they 
provide some insight into the structure of navigation problems faced in real-world applications. 
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6.2 Servers 

We have addressed the notion of real-time 2-server algorithms — those that only use only 
a constant amount of spacer and constant time per request. It is well-known that the best 
competitive ratio attainable by a 2-server algorithm is 2rand this can be achieved by several 
algorithmsrall of which perform a substantial amount of computation on each request. We have 
shown a 2-competitive real-time algorithm in n dimensionsrfor every nFunder the Manhattan 
metric. On the other handrwe have shown that the class of balancing algorithms (a subset of 
the real-time algorithms) cannot achieve a competitive ratio better than 5+ 2 in general. 

There are some immediate questions one could consider in light of these resultsFas well as 
a more global one. SpecincailyFcan our algorithm for the Manhattan metric be adapted to the 
standard Euclidean metric while maintaining the competitive ratio of 2? Can it be extended 
to the case of three or more servers? (We can show that most natural generalizations in this 
direction are not 3-competitive.) It would also be interesting to try improving the upper bounds 
for 2-server balancing algorithms; the lower bound we show in Chapter 5 is still below the best 
known competitive ratio for a general real-time algorithm. 

But at a higher leveir we believe the fundamental problem here is to decide whether or 
not there is a real-time 2-server algorithm with a competitive ratio of 2. We do not venture a 
conjecture; the results of the previous chapter present evidence for both possible answers. Of 
courserbne could ask the analogous question for more than two serverslwhere all these problems 
become much murkier . . . but the main point is this: lower bounds for on-line algorithms have 
typically been based simply on information-theoretic arguments alone; that isFregardless of how 
the algorithm makes its decisionsrthere will be input sequences on which it performs badly. The 
hope is that as we begin to understand this area betterFwe can also begin making distinctions 
based on the computational resources an algorithm has at its disposal. 
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